High tryptophan maize

ABSTRACT

The present invention provides a method for altering the tryptophan content of a plant by introducing and expressing an artificial polynucleotide sequence encoding an anthranilate synthase in the cells of the plant. Transgenic plants transformed with an artificial polynucleotide sequence encoding an anthranilate synthase, as well as human or animal food, seeds, and progeny derived from these plants, are also provided.

This application is a Continuation-in-Part of U.S. application Ser. No. 10/430,011, filed May 5, 2003, which claims the priority of U.S. Provisional Application Ser. No. 60/377,727, filed May 3, 2002; each of which applications is incorporated herein by reference.

INCORPORATION OF SEQUENCE LISTING

A computer readable form of the sequence listing is contained in the file named “MONS109US.txt” which is 351 kb (measured in MS-Windows) and was created on Aug. 11, 2006 and is located on a CDROM, which is filed herewith and is herein incorporated by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates generally to methods and compositions for improving expression of heterologous genes in order to enhance the nutritional content of plants.

2. Description of Related Art

The seeds of a number of important crops, including soybean and maize do not contain sufficient quantities of several amino acids to be nutritionally complete. These amino acids include, but are not limited to: tryptophan, isoleucine, valine, arginine, lysine, methionine, and threonine. Therefore, the biosynthetic pathways for these amino acids, and/or biosynthetic pathways for metabolites that feed into those pathways, are potential targets for manipulation in order to increase the amino acid content of these plants.

Anthranilate synthase (AS, EC 4.1.3.27) catalyzes the first reaction branching from the aromatic amino acid pathway to the biosynthesis of tryptophan in plants, fungi, and bacteria. The most common form of anthranilate synthase (for example, the maize anthranilate synthase) is a heterotetrameric enzyme consisting of two subunits, the α or TrpE subunit and the β or TrpG subunit. Two α-subunits and two β-subunits assemble to form the heterotetrameric anthranilate synthases. “Monomeric” forms of AS have also been discovered that comprise a single polypeptide chain having the activities of both TrpE and TrpG subunits (for example from Rhizobium meliloti). While monomeric anthranilate synthases comprise just one type of polypeptide, the enzymatically active form of a monomeric anthranilate synthase is typically a homodimer consisting of two such monomeric polypeptides. Both heterotetrameric and monomeric anthranilate synthases catalyze the formation of anthranilate in a reaction utilizing glutamine and chorismate. The domain found on the α-subunit (referred to herein as the “α-domain”) binds chorismate and eliminates the enolpyruvate side chain, and the domain found on the β-subunit (referred to herein as the “β-domain”) transfers an amino group from glutamine to the position on the chorismate phenyl ring that resides between the carboxylate and the enolpyruvate moieties.

The next reaction in the synthesis of tryptophan is the transfer of the phosphoribosyl moiety of phosphoribosyl pyrophosphate to anthranilate. The indole ring is formed in two steps involving an isomerization converting the ribose group to a ribulose followed by a cyclization reaction to yield indole glycerol phosphate. The final reaction in the pathway is catalyzed by a single enzyme that may contain either one or two subunits. The reaction accomplishes the cleavage of indole glyceraldehyde-3-phosphate and condensation of the indole group with serine (Umbarger, 1978).

Metabolite flow in the tryptophan pathway in higher plants and microorganisms is apparently regulated through feedback inhibition of anthranilate synthase by tryptophan. Tryptophan may block the conformational rearrangement that is required to activate the β-domain and to create a channel for passage of ammonia toward the active site of the α-domain. Such feedback inhibition by tryptophan is believed to depress the production of tryptophan by anthranilate synthase (see Li and Last, 1996).

Several amino acid residues have been identified as being involved in the feedback regulation of the anthranilate synthase complex from Salmonella typhimurium. Such information provides evidence of an amino-terminal regulatory site (Caligiuri and Bauerle, 1991). Niyogi et al. have further characterized the anthranilate synthase from certain plants employing a molecular approach (Niyogi and Fink, 1992 and Niyogi et al., 1993). They found that the α-subunits of the Arabidopsis anthranilate synthase are encoded by two closely related, nonallelic genes that are differentially regulated. One of these α-subunit genes, ASA1, is induced by wounding and bacterial pathogen infiltration, implicating its involvement in a defense response, whereas the other α-subunit gene, ASA2, is expressed at constitutive basal levels. Both predicted proteins share regions of homology with bacterial and fungal anthranilate synthase proteins, and contain conserved amino acid residues at positions that have been shown to be involved in tryptophan feedback inhibition in bacteria (Caligiuri et al., 1991).

Amino acid analogs of tryptophan and analogs of the intermediates in the tryptophan biosynthetic pathway (e.g., 5-methyltryptophan, 4-methyltryptophan, 5-fluorotryptophan, 5-hydroxytryptophan, 7-azatryptophan, 3β-indoleacrylic acid, 3-methylanthranilic acid), have been shown to inhibit the growth of both prokaryotic and eukaryotic organisms. Plant cell cultures can be selected for resistance to these amino acid analogs. For example, cultured tobacco, carrot, potato, corn and Datura innoxia cell lines have been selected that are resistant to growth inhibition by 5-methyltryptophan (5-MT), an amino acid analog of tryptophan, due to expression of an altered anthranilate synthase.

Ranch et al. (1983) selected for 5-MT resistance in cell cultures of Datura innoxia, a dicot weed, and reported that the resistant cell cultures contained increased tryptophan levels (8 to 30 times higher than the wild type level) and an anthranilate synthase with less sensitivity to tryptophan feedback inhibition. Regenerated plants were also resistant to 5-MT, contained an altered anthranilate synthase, and had greater concentrations of free tryptophan (4 to 44 times) in the leaves than did the leaves of the control plants. In contrast to the studies with N. tabacum, where the altered enzyme was not expressed in plants regenerated from resistant cell lines, these results indicated that the amino acid overproduction phenotype could be selected at the cellular level and expressed in whole plants regenerated from the selected cells in Datura innoxia.

Hibberd et al. (U.S. Pat. No. 4,581,847) described 5-MT resistant maize cell lines that contained an anthranilate synthase that was less sensitive to feedback inhibition than wild-type anthranilate synthase. One 5-MT resistant cell line accumulated free tryptophan at levels almost twenty-fold greater than that of non-transformed cell lines.

Anderson et al. (U.S. Pat. No. 6,118,047) disclose the use of a tryptophan-insensitive α-domain of anthranilate synthase from C28 maize in a transgene to prepare transgenic maize plants (Zea mays) exhibiting elevated levels of free tryptophan in the seed(s).

Although it is possible to select for 5-MT resistance in certain cell cultures and plants, this characteristic does not necessarily correlate with the overproduction of free tryptophan in whole plants. Additionally, plants regenerated from 5-MT resistant lines frequently do not express an altered form of the enzyme. Nor is it predictable that this characteristic will be stable over a period of time and will be passed along as a heritable trait.

Anthranilate synthase has also been partially purified from crude extracts of cell cultures of higher plants (Hankins et al., 1976; Widholm, 1973). However, it was found to be very unstable. Thus, there is a need to provide plants with a source of anthranilate synthase that can increase the tryptophan content of plants.

SUMMARY OF THE INVENTION

The present invention provides nucleic acids encoding an anthranilate synthase (AS) that can be used to generate transgenic plants. When such anthranilate synthase nucleic acids are expressed in a transgenic plant, elevated levels of tryptophan can be achieved within the cells of the plant. In one embodiment, the present invention is directed to DNA molecules that encode a monomeric anthranilate synthase, where such a monomeric anthranilate synthase is a natural or genetically engineered chimeric fusion of the α- and β-domains of an anthranilate synthase. The anthranilate synthase gene from a few species (e.g., some bacteria and other microbes) naturally gives rise to a monomeric anthranilate synthase that constitutes a single polypeptide chain. However, most species have a heterotetrameric anthranilate synthase composed of two α- and two β-domains found on separate subunits. The present invention also contemplates formation of chimeric anthranilate synthase fusion proteins comprising any anthranilate synthase α-domain linked to any β-domain.

In general, the sequence identity of naturally occurring monomeric anthranilate synthases with most plant anthranilate synthases is quite low. However, according to the invention, such monomeric anthranilate synthases can provide high levels of tryptophan when expressed in a plant, despite a low sequence identity with the plant's endogenous anthranilate synthase enzyme. Accordingly, the present invention provides monomeric anthranilate synthases that can have divergent sequences and that are capable of efficiently providing high levels of tryptophan in a plant host. For example, transgenic soybean plants containing the monomeric Agrobacterium tumefaciens anthranilate synthase can produce from up to about 10,000 to about 12,000 ppm tryptophan in seeds, with average trp levels ranging up to about 7,000 to about 8,000 ppm. In contrast, non-transgenic soybean plants normally have up to only about 100 to about 200 ppm tryptophan in seeds.

Accordingly, the present invention provides an isolated DNA sequence encoding a monomeric anthranilate synthase, wherein the monomeric anthranilate synthase has an anthranilate α-domain and an anthranilate β-domain and wherein the monomeric anthranilate synthase is expressed in a plant. Such expression can elevate the level of L-tryptophan in the plant.

The monomeric anthranilate synthase can be naturally monomeric. Examples of organisms from which naturally monomeric anthranilate synthase nucleic acids may be isolated, include but are not limited to organisms such as Agrobacterium tumefaciens, Rhizobium meliloti (e.g., Genbank Accession No. GI 95177), Mesorhizobium loti (e.g., Genbank Accession No. GI 13472468), Brucella melitensis (e.g., Genbank Accession No. GI 17982357), Nostoc sp. PCC7120 (e.g., Genbank Accession Nos. GI 17227910 or GI 17230725), Azospirillum brasilense (e.g., Genbank Accession No. GI 1174156) and Anabaena M22983 (e.g., Genbank Accession No. GI 152445). In some embodiments, the isolated DNA encodes an Agrobacterium tumefaciens anthranilate synthase having, for example, an amino acid sequence having SEQ ID NO: 4 or a nucleotide sequence having any one of SEQ ID NOs: 1 or 75.

Alternatively, the monomeric anthranilate synthase can be a fusion of any available anthranilate synthase α and β domain. Such α and β domains can be derived from Zea mays, Ruta graveolens, Sulfolobus solfataricus, Salmonella typhimurium, Serratia marcescens, Escherichia coli, Agrobacterium tumefaciens, Arabidopsis thaliana, Rhizobium meliloti (e.g., Genbank Accession No. GI 95177), Mesorhizobium loti (e.g., Genbank Accession No. GI 13472468), Brucella melitensis (e.g., Genbank Accession No. GI 17982357), Nostoc sp. PCC7120 (e.g., Genbank Accession No. GI 17227910 or GI 17230725), Azospirillum brasilense (e.g., Genbank Accession No. GI 1174156) and Anabaena M22983 (e.g., Genbank Accession No. GI 152445), soybean, rice, cotton, wheat, tobacco or any gene encoding α subunit or domain of anthranilate synthase. For example, nucleic acids encoding such an α or β domain can be obtained by using the sequence information in any of SEQ ID NOs: 1-70, 75-113, and 116-137.

In another embodiment, the invention provides an isolated DNA encoding an α domain of anthranilate synthase from Zea mays that comprises SEQ ID NOs: 5 or 66. Such an isolated DNA can have nucleotide sequence SEQ ID NOs: 2, 67, or 68. The isolated DNA can be operably linked to a promoter and, when expressed in a plant can provide elevated levels of L-tryptophan in the plant.

In yet another embodiment, the invention provides an isolated DNA molecule encoding an anthranilate synthase wherein the DNA molecule encodes a protein substantially homologous to an anthranilate synthase protein exemplified by SEQ ID NOs: 66, 108-111, 133, and 137. The isolated DNA encoding an anthranilate synthase comprises a DNA molecule substantially homologous to a DNA molecule exemplified by SEQ ID NOs: 67, 68, 104-107, and 134-136.

In still another embodiment, the present invention provides a DNA construct comprising an expression cassette, wherein the expression cassette in operable linkage comprises (i) a heterologous promoter; (ii) a DNA molecule encoding a monomeric anthranilate synthase protein, wherein the monomeric anthranilate synthase comprises a single polypeptide comprising an anthranilate synthase α-domain and an anthranilate synthase β-domain, and (iii) a transcriptional terminator. The monomeric anthranilate synthase protein may comprise a protein substantially homologous to proteins exemplified by SEQ ID NOs: 4, 7, 43, 57, 77-82, and 130-132. The DNA molecule may comprise a DNA molecule substantially homologous to a DNA molecule exemplified by SEQ ID NOs: 1, 75, 76, 83, and 121-129.

In a further embodiment, the present invention provides a DNA construct comprising a first expression cassette, wherein the first expression cassette in operable linkage comprises (i) a heterologous promoter; (ii) a DNA molecule encoding an anthranilate synthase α-domain protein and (iii) a transcriptional terminator.

The above DNA construct may further comprise a second expression cassette in operable linkage comprising (i) a heterologous promoter; (ii) a DNA molecule encoding an anthranilate synthase β-domain protein and (iii) a transcriptional terminator. The DNA construct may comprise an α-domain or β-domain protein substantially homologous to a protein exemplified by SEQ ID NOs: 5, 6, 8, 44, 45, 66, 99, 100, 101, 102, 103, 108, 109, 110, 111, 117, 118, 133, or 137. The DNA molecule encoding an anthranilate synthase α-domain or β-domain protein may comprise a DNA molecule substantially homologous to a DNA molecule exemplified by SEQ ID NOs: 2, 3, 67, 94, 95, 96, 97, 98, 104, 105, 106, 112, 116, 119, 120, 134, 135, or 136. A specific example comprises a DNA construct where the α-domain anthranilate synthase protein is SEQ ID NO: 66 and the β-domain protein is SEQ ID NO: 118.

The isolated DNA can also encode a mutant anthranilate synthase, or a mutant anthranilate synthase domain. Such a mutant anthranilate synthase, or domain thereof, can have one or more mutations. As is known to one of skill in the art, mutations can be silent, can give rise to variant gene products having enzymatic activity similar to wild type or can give rise to derivative gene products that have altered enzymatic activity. The present invention contemplates all such mutations.

The mutated isolated DNA can be generated from a wild type anthranilate synthase nucleic acid either in vitro or in vivo and can encode, for example, one or more amino acid substitutions, deletions or insertions. Mutant isolated DNAs that generate a mutant anthranilate synthase having increased activity, greater stability, or less sensitivity to feedback inhibition by tryptophan or tryptophan analogs are desirable. In one embodiment, the anthranilate synthase, or a domain thereof, is resistant to inhibition by endogenous L-tryptophan or by tryptophan analogs. For example, the anthranilate synthase can have one or more mutations in the tryptophan-binding pocket or elsewhere that reduces the sensitivity of the anthranilate synthase, or the domain thereof, to tryptophan inhibition. Among the amino acid residues contemplated for mutation are residues, for example, at about positions 48, 51, 52, 293, and 298. For example, the mutation can be:

a) at about position 48 replace Val with Phe;

b) at about position 48 replace Val with Tyr;

c) at about position 51 replace Ser with Phe;

d) at about position 51 replace Ser with Cys;

e) at about position 52 replace Asn with Phe;

f) at about position 293 replace Pro with Ala;

g) at about position 293 replace Pro with Gly; or

h) at about position 298 replace Phe with Trp;

wherein the position of the mutation is determined by alignment of the amino acid sequence of the selected anthranilate synthase with an Agrobacterium tumefaciens anthranilate synthase amino acid sequence. Examples of anthranilate synthases having such mutations include those with SEQ ID NOs: 58-65, 69, 70, and 84-94.

Alternatively, the DNA that encodes a mutant anthranilate synthase can be an artificial polynucleotide having one or more mutations in the tryptophan-binding pocket or elsewhere that reduces the sensitivity of the anthranilate synthase, or the domain thereof, to tryptophan inhibition. Among the amino acid residues contemplated for mutation are residues, for example, at about positions 48, 51, 52, 293, and 298. For example, the mutation can be:

a) at about position 48 replace Val with Phe;

b) at about position 48 replace Val with Tyr;

c) at about position 51 replace Ser with Phe;

d) at about position 51 replace Ser with Cys;

e) at about position 52 replace Asn with Phe;

f) at about position 293 replace Pro with Ala;

g) at about position 293 replace Pro with Gly; or

h) at about position 298 replace Phe with Trp;

wherein the position of the mutation is determined by alignment of the amino acid sequence of the selected anthranilate synthase with an Agrobacterium tumefaciens anthranilate synthase (AgroAS) amino acid sequence. Examples of anthranilate synthases having such mutations include the polypeptides of SEQ ID NOs:60-61, encoded by the nucleic acid sequences of SEQ ID NOs:144-145.

The isolated DNA can encode other elements and functions. Any element or function contemplated by one of skill in the art can be included. For example, the isolated DNA can also include a promoter that can function in a plant cell that is operably linked to the DNA encoding the anthranilate synthase. The isolated DNA can further encode a plastid transit peptide. The isolated DNA can also encode a selectable marker or a reporter gene. Such a selectable marker gene can impart herbicide resistance to cells of said plant, high protein content, high oil content, high lysine content, high isoleucine content, high tocopherol content and the like. The DNA sequence can also comprise a sequence encoding one or more of the insecticidal proteins derived from Bacillus thuringiensis.

The present invention further provides vectors comprising an isolated DNA of the invention. Such vectors can be used to express anthranilate synthase polypeptides in prokaryotic and eukaryotic cells, to transform plant cells and to generate transgenic plants.

The present invention also provides a transgenic plant comprising an isolated DNA of the invention. Expression of these isolated DNAs in the transgenic plant can result in an elevated level of L-tryptophan, preferably free L-tryptophan, in the transgenic plant, e.g., in the seeds or other parts of the plant. The level is increased above the level of L-tryptophan in the cells of a plant that differ from the cells of the transgenic soybean plant by the absence of the DNA, e.g., the corresponding untransformed cells or an untransformed plant with the same genetic background. The DNA is preferably heritable in that it is preferably transmitted through a complete normal sexual cycle of the fertile plant to its progeny and to further generations.

Transgenic plants that can have such an isolated DNA include dicotyledonous plants (dicots), for example, soybean or canola. Alternatively, the transgenic plants can be monocotyledonous plants (monocots), for example, maize, rice, wheat, barley, millet, oat, or sorghum.

The present invention also provides a seed of any of the transgenic plants containing any of the isolated DNAs, anthranilate synthase polypeptides, transgenes or vectors of the invention.

The present invention further provides an animal feed or human food that contains at least a portion of a plant having an isolated DNA or DNA construct of the invention. Portions of plants that can be included in the animal feed or human food include, for example, seeds, leaves, stems, roots, tubers, or fruits. Desirable portions of plants have increased levels of tryptophan provided by expression of an anthranilate synthase encoded by an isolated DNA of the invention.

The present invention further provides a method for altering, preferably increasing, the tryptophan content of a plant (dicot or a monocot) by introducing an isolated DNA of the invention into regenerable cells of the plant. The DNA sequence is preferably operably linked to at least one promoter operable in the plant cells. The transformed cells are identified or selected, and then regenerated to yield a plant comprising cells that can express a functional anthranilate synthase polypeptide. In some embodiments, the DNA encoding the anthranilate synthase, or domain thereof, is a mutant DNA. In other embodiments, the DNA encoding the anthranilate synthase, or domain thereof, is an artificial polynucleotide. The introduced DNA is preferably heritable and the plant is preferably a fertile plant. For example, the introduced DNA preferably can be passed by a complete sexual cycle to progeny plants, and can impart the high tryptophan phenotype to subsequent generations of progeny.

The anthranilate synthase-encoding DNAs, are preferably incorporated into vectors or “transgenes” that can also include DNA sequences encoding transit peptides, such as plastid transit peptides, and selectable marker or reporter genes, operably linked to one or more promoters that are functional in cells of the target plant. The promoter can be, for example, an inducible promoter, a tissue specific promoter, a strong promoter or a weak promoter. Other transcription or translation regulatory elements, e.g., enhancers or terminators, can also be functionally linked to the anthranilate synthase-encoding DNA segment.

Cells in suspension culture or as embryos, intact tissues or organs can be transformed by a wide variety of transformation techniques, for example, by microprojectile bombardment, electroporation and Agrobacterium tumefaciens-mediated transformation, and other procedures available to the art.

Thus, the cells of the transformed plant comprise a native anthranilate synthase gene and a transgene or other DNA segment encoding an exogenous anthranilate synthase. The expression of the exogenous anthranilate synthase in the cells of the plant can lead to increased levels of tryptophan and its secondary metabolites. In some embodiments, such expression confers tolerance to an amount of endogenous L-tryptophan analogue, for example, so that at least about 10% more anthranilate synthase activity is present than in a plant cell having a wild type or tryptophan-sensitive anthranilate synthase.

The present invention also provides a method for altering the tryptophan content in a plant comprising: (a) introducing into regenerable cells of a plant a transgene comprising an isolated DNA encoding an anthranilate synthase domain and a plastid transit peptide, operably linked to a promoter functional in the plant cell to yield transformed cells; and (b) regenerating a transformed plant from said transformed plant cells wherein the cells of the plant express the anthranilate synthase domain encoded by the isolated DNA in an amount effective to increase the tryptophan content in said plant relative to the tryptophan content in an untransformed plant of the same genetic background. The domain can be an anthranilate synthase α-domain. The anthranilate synthase domain can have one or more mutations, for example, mutations that reduce the sensitivity of the domain to tryptophan inhibition. Such mutations can be, for example, in the tryptophan-binding pocket. Such a domain can be, for example, an anthranilate synthase domain from Agrobacterium tumefaciens, Anabaena M22983, Arabidopsis thaliana, Azospirillum brasilense, Brucella melitensis, Escherichia coli, Euglena gracilis, Mesorhizobium loti, Nostoc sp. PCC7120, Rhizobium meliloti, Ruta graveolens, Rhodopseudomonas palustris, Salmonella typhimurium, Serratia marcescens, Sulfolobus solfataricus, soybean, rice, cotton or Zea mays. Ruta graveolens has its own chloroplast transport sequence that may be used with the anthranilate synthase transgene. Accordingly, one of skill in the art may not need to add a plastid transport sequence when using a Ruta graveolens DNA.

The present invention also provides novel isolated and purified DNA molecules comprising a DNA encoding a monomeric anthranilate synthase, or a domain thereof. Such an anthranilate synthase DNA can provide high levels of tryptophan when expressed within a plant. In some embodiments, the anthranilate synthase is substantially resistant to inhibition by free L-tryptophan or an analog thereof. Examples of novel DNA sequences contemplated by the invention include but are not limited to DNA molecules isolated from Agrobacterium tumefaciens, Anabaena M22983, Arabidopsis thaliana, Azospirillum brasilense, Bradyrhizobium japonicum, Brucella melitensis, Escherichia coli, Euglena gracilis, Mesorhizobium loti, Nostoc sp. PCC7120, Rhizobium meliloti, Ruta graveolens, Rhodopseudomonas palustris, Rhodospirillum rubrum, Salmonella typhimurium, Serratia marcescens, Sorghum bicolor, Sulfolobus solfataricus, Thermobifida fiisca, or Zea mays(maize), or other such anthranilate synthases.

These DNA sequences include synthetic or naturally-occurring monomeric forms of anthranilate synthase that have the α-domain of anthranilate synthase linked to at least one other anthranilate synthase domain on a single polypeptide chain. The monomeric anthranilate synthase can, for example, be a fusion of an anthranilate synthase α or β domain. Such an anthranilate synthase α or β domain can be derived from Agrobacterium tumefaciens, Anabaena M22983, Arabidopsis thaliana, Azospirillum brasilense, Bradyrhizobium japonicum, Brucella melitensis, Escherichia coli, Euglena gracilis, Mesorhizobium loti, Nostoc sp. PCC7120, Rhizobium meliloti, Ruta graveolens, Rhodopseudomonas palustris, Rhodospirillum rubrum, Salmonella typhimurium, Serratia marcescens, Sorghum bicolor, Sulfolobus solfataricus, Thermobifida fusca, sorghum, soybean, rice, cotton, wheat, tobacco, or Zea mays (maize) or any gene encoding a subunit or domain of anthranilate synthase. Such anthranilate synthases and domains thereof are also exemplified herein by the anthranilate synthase nucleic acids isolated from Agrobacterium tumefaciens, (SEQ ID NOs: 1, 75, or 84-94), Zea mays, (SEQ ID NOs: 2, 67, 68, 96, 116, or 136), Ruta graveolens (SEQ ID NO: 3), Anabaena M22983, Arabidopsis thaliana (SEQ ID NO: 45), Azospirillum brasilense (SEQ ID NO: 122), Brucella melitensis (SEQ ID NO: 123), Mesorhizobium loti (SEQ ID NO: 121), Nostoc sp. PCC7120 (SEQ ID NOs: 124 or 125), Rhizobium meliloti, Rhodopseudomonas palustris (SEQ ID NO: 126), Sulfolobus solfataricus, rice (SEQ ID NOs: 94, 95, 119, or 120), wheat (SEQ ID NO: 97), tobacco (SEQ ID NO: 98), Gossypium hirsutum (SEQ ID NOs: 104 or 105), Glycine max (SEQ ID NOs: 106, 107, 112, or 113), Bradyrhizobium janonicum (SEQ ID NO: 127), Rhodospirillum rubrum (SEQ ID NO: 128), Thermobifida fusca (SEQ ID NO: 129) or Sorghum bicolor (SEQ ID NOs: 134 or 135). These nucleotide sequences encode anthranilate synthases or α-domains or β domains thereof from Agrobacterium tumefaciens (SEQ ID NOs: 4, 58-65, 69, or 70); Zea mays (SEQ ID NOs: 5, 66, 101, 118, or 137) and Ruta graveolens (SEQ ID NO: 6), Anabaena M22983, Azospirillum brasilense (SEQ ID NO: 78), Brucella melitensis (SEQ ID NO: 79), Mesorhizobium loti (SEQ ID NO: 77), Nostoc sp. PCC7120 (SEQ ID NOs: 80 or 81), Rhizobium meliloti (SEQ ID NOs: 7 or 43), Rhodopseudomonas palustris (SEQ ID NOs: 57 or 82), Sulfolobus solfataricus (SEQ ID NOs: 8 or 44), rice (SEQ ID NOs: 99, 100, or 117), wheat (SEQ ID NO: 102), tobacco (SEQ ID NO: 103), Gossypium hirsutum (SEQ ID NOs: 108 or 109), Glycine max (SEQ ID NOs: 110 or 111), Bradyrhizobium japonicum (SEQ ID NO: 130), Rhodospirillum rubrum (SEQ ID NO: 131), Thermobifida fusca (SEQ ID NO: 132) or Sorghum bicolor (SEQ ID NO: 133).

The present invention also provides an isolated DNA molecule comprising a DNA sequence encoding an Agrobacterium tumefaciens anthranilate synthase or a domain thereof having enzymatic activity. Such a DNA molecule can encode an anthranilate synthase having SEQ ID NOs: 4, 58-65, 69, or 70, a domain or variant thereof having anthranilate synthase activity. The DNA molecule can also have a sequence comprising SEQ ID NOs: 1, 75, 84-93 or 144-145, or a domain or variant thereof. Coding regions of any DNA molecule provided herein can also be optimized for expression in a selected organism, for example, a selected plant or microbe. Examples of DNA molecules having optimized codon usage for a selected plant are Agrobacterium tumefaciens anthranilate synthase-encoding DNA molecules having SEQ ID NOs: 75, 144 and 145.

The present invention also provides an isolated and purified DNA molecule comprising a DNA sequence encoding a Zea mays anthranilate synthase domain. Such a DNA molecule can encode an anthranilate synthase domain having SEQ ID NOs: 5 or 66, or a variant or derivative thereof having anthranilate synthase activity. The DNA molecule can also have a sequence comprising SEQ ID NOs: 2, 67, or 68, or a domain or variant thereof.

The present invention further provides an isolated DNA molecule of at least 8 nucleotides that hybridizes to the complement of a DNA molecule comprising any one of SEQ ID NOs: 1, 75, or 84-94 under stringent conditions. Such a DNA molecule can be a probe or a primer, for example, a nucleic acid having any one of SEQ ID NOs: 9-42, 47-56, or 138-143. Alternatively, the DNA can include up to an entire coding region for a selected anthranilate synthase, or a domain thereof. Such a DNA can also include a DNA sequence encoding a promoter operable in plant cells and/or a DNA sequence encoding a plastid transit peptide. The present invention further contemplates vectors for transformation and expression of these types of DNA molecules in plants and/or microbes.

Functional anthranilate synthase DNA sequences and functional anthranilate synthase polypeptides that exhibit 50%, preferably 60%, more preferably 70%, even more preferably 84%, most preferably 90%, e.g., 95% to 99%, sequence identity to the DNA sequences and amino acid sequences explicitly described herein are also within the scope of the present invention. For example, 85% identity means that 85% of the amino acids are identical when the 2 sequences are aligned for maximum matching. Gaps (in either of the 2 sequences being matched) are allowed in maximizing matching; gap lengths of 5 or less are preferred with 2 or less being more preferred.

Alternatively and preferably, two polypeptide sequences are homologous, as this term is used herein, if they have an alignment score of more than 5 (in standard deviation units) using the program ALIGN with the mutation data matrix and a gap penalty of 6 or greater. See Dayhoff (1972). The 2 sequences or parts thereof are more preferably homologous if their amino acids are greater than or equal to 50% identical when optimally aligned using the ALIGN program. The present invention further provides expression vectors for generating a transgenic plant with high seed levels of tryptophan comprising an isolated DNA sequence encoding a monomeric anthranilate synthase comprising an anthranilate synthase α-domain linked to an anthranilate synthase β-domain and a plastid transit peptide, operably linked to a promoter functional in a plant cell. Such a monomeric anthranilate synthase can, for example, be an Agrobacterium tumefaciens, Rhizobium meliloti, Mesorhizobium loti, Brucella melitensis, Nostoc sp. PCC7120, Azospirillum brasilense, Anabaena M22983, Bradyrhizobium japonicum, Rhodospirillum rubrum, or Thermobifida fusca anthranilate synthase. The monomeric anthranilate synthase can also be a fusion of anthranilate synthase α- and β-domains derived from Agrobacterium tumefaciens, Anabaena M22983, Arabidopsis thaliana, Azospirillum brasilense, Brucella melitensis, Mesorhizobium loti, Nostoc sp. PCC7120, Rhizobium meliloti, Rhodopseudomonas palustris, Ruta graveolens, Sulfolobus solfataricus, Salmonella typhimurium, Serratia marcescens, Bradyrhizobium japonicum, Rhodospirillum rubrum, Thermobifida fusca, Sorghum bicolor, soybean, rice, cotton, wheat, tobacco, Zea mays, or any gene encoding a subunit or domain of anthranilate synthase.

The transmission of the isolated and purified anthranilate synthase DNA providing increased levels of tryptophan can be evaluated at a molecular level, e.g., Southern or Northern blot analysis, PCR-based methodologies, the biochemical or immunological detection of anthranilate synthase, or by phenotypic analyses, i.e., whether cells of the transformed progeny can grow in the presence of an amount of an amino acid analog of tryptophan that inhibits the growth of untransformed plant cells.

The present invention also provides a method of producing anthranilate synthase in a prokaryotic or eukaryotic host cell, such as a yeast, insect cell, or bacterium, which can be cultured, preferably on a commercial scale. The method includes the steps of introducing a transgene comprising a DNA segment encoding an anthranilate synthase, or a domain thereof, such as a monomeric anthranilate synthase, comprising at least the α and β anthranilate synthase domains, or functional variant thereof, into a host cell and expressing anthranilate synthase in the host cell so as to yield functional anthranilate synthase or domain thereof. A transgene generally includes transcription and translation regulatory elements, e.g., a promoter, functional in host cell, either of eukaryotic or prokaryotic origin. Preferably, the transgene is introduced into a prokaryotic cell, such as Escherichia coli, or a eukaryotic cell, such as a yeast or insect cell, that is known to be useful for production of recombinant proteins. Culturing the transformed cells can lead to enhanced production of tryptophan and its derivatives, which can be recovered from the cells or from the culture media. Accumulation of tryptophan may also lead to the increased production of secondary metabolites in microbes and plants, for example, indole containing metabolites such as simple indoles, indole conjugates, indole alkaloids, indole phytoalexins and indole glucosinalates in plants.

Anthranilate synthases insensitive to tryptophan have the potential to increase a variety of chorismate-derived metabolites, including those derived from phenylalanine due to the stimulation of phenylalanine synthesis by tryptophan via chorismate mutase (Siehl, 1999). Other chorismate-derived metabolites that may increase when feedback insensitive anthranilate synthase s are present include phenylpropanoids, flavonoids, and isoflavonoids, as well as those derived from anthranilate, such as indole, indole alkaloids, and indole glucosinolates. Many of these compounds are important plant hormones, plant defense compounds, chemopreventive agents of various health conditions, and/or pharmacologically active compounds. The range of these compounds whose synthesis might be increased by expression of anthranilate synthase depends on the organism in which the anthranilate synthase is expressed. The present invention contemplates synthesis of tryptophan and other useful compounds in a variety of prokaryotic and eukaryotic cells or organisms, including plant cells, microbes, fungi, yeast, bacteria, insect cells, and mammalian cells.

Hence, the present invention provides a method for producing tryptophan comprising: culturing a prokaryotic or eukaryotic host cell comprising an isolated DNA under conditions sufficient to express a monomeric anthranilate synthase encoded by the isolated DNA, wherein the monomeric anthranilate synthase comprises an anthranilate synthase α domain and an anthranilate synthase β domain, and wherein the conditions sufficient to express a monomeric anthranilate synthase comprise nutrients and precursors sufficient for the host cell to synthesize tryptophan utilizing the monomeric anthranilate synthase.

Examples of useful compounds that may be generated upon expression in a variety of host cells and/or organisms include indole acetic acid and other auxins, isoflavonoid compounds important to cardiovascular health found in soy, volatile indole compounds which act as signals to natural enemies of herbivorous insects in maize, anticarcinogens such as indole glucosinolates (indole-3-carbinol) found in the Cruciferae plant family, as well as indole alkaloids such as ergot compounds produced by certain species of fungi. (Barnes et al, 1996; Frey et al., 2000; Muller et al., 2000; Mantegani et al., 1999; Zeligs, 1998; Mash et al., 1998; Melanson et al., 1997; Broadbent et al., 1998).

The present invention also provides an isolated and purified DNA molecule of at least seven nucleotide bases that hybridizes under moderate, and preferably, high stringency conditions to the complement of an anthranilate synthase encoding DNA molecule. Such isolated and purified DNA molecules comprise novel DNA segments encoding anthranilate synthase or a domain or mutant thereof. The mutant DNA can encode an anthranilate synthase that is substantially resistant to inhibition by free L-tryptophan or an amino acid analog of tryptophan. Such anthranilate synthase DNA molecules can hybridize, for example, to an Agrobacterium tumefaciens, Rhodopseudomonas palustris or Ruta graveolens anthranilate synthase, or an α-domain thereof, including functional mutants thereof. When these DNA molecules encode a functional anthranilate synthase or an anthranilate synthase domain, they are termed “variants” of the primary DNA molecules encoding anthranilate synthase, anthranilate synthase domains or mutants thereof. Shorter DNA molecules or oligonucleotides can be employed as primers for amplification of target DNA sequences by PCR, or as intermediates in the synthesis of full-length genes.

Also provided is a hybridization probe comprising a novel isolated and purified DNA segment of at least seven nucleotide bases, which is detectably labeled or which can bind to a detectable label, which DNA segment hybridizes under moderate or, preferably, high stringency conditions to the non-coding strand of a DNA molecule comprising a DNA segment encoding an anthranilate synthase such as a monomeric anthranilate synthase, or a domain thereof, such as the α-domain, including functional mutants thereof, that are substantially resistant to inhibition by an amino acid analog of tryptophan. Moderate and stringent hybridization conditions are well known to the art, see, for example sections 0.47-9.51 of Sambrook et al., (1989); see, also, Sambrook and Russell, (2001). For example, stringent conditions are those that (1) employ low ionic strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50° C., or (2) employ a denaturing agent such as formamide during hybridization, e.g., 50% formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42° C. Another example is use of 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5×Denhardt's solution, sonicated salmon sperm DNA (50 μg/ml), 0.1% sodium dodecylsulfate (SDS), and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC and 0.1% SDS.

The present invention also provides methods and compositions for providing a polynucleotide molecule that encodes a tryptophan feedback insensitive anthranilate synthase comprising an artificial polynucleotide molecule substantially identical to SEQ ID NO: 145; wherein said artificial polynucleotide molecule provides enhanced levels of at least one amino acid in a transgenic corn cell comprising said artificial polynucleotide molecule when compared to a corn cell not comprising said artificial polynucleotide molecule.

The artificial polynucleotide molecules of the present invention encode proteins that provide nutritionally enhanced phenotypes to a transgenic cell containing a DNA construct comprising said artificial polynucleotide molecules. The nutritionally enhanced phenotypes include, but are not limited to, the production of elevated levels of tryptophan in the cell when compared to tryptophan levels seen in cells not comprising said artificial polynucleotide molecules.

The present invention further provides a method of providing an increased level of tryptophan in the seed of a plant comprising: a) constructing an artificial polynucleotide molecule; b) constructing a DNA construct containing said artificial polynucleotide molecule; c) transforming said DNA construct into a plant cell; and d) regenerating said plant cell into a fertile transgenic plant.

The present invention further provides nutritionally enhanced feed products and methods of making such products comprising seed of transgenic corn cells with an increased level of tryptophan; and processing such seed into a meal, protein or oil.

DESCRIPTION OF THE FIGURES

FIG. 1 depicts the translated sequence of the Agrobacterium tumefaciens anthranilate synthase DNA sequence (upper sequence) (SEQ ID NO: 4) and the translated sequence of the anthranilate synthase DNA sequence from Rhizobium meliloti (lower sequence) (SEQ ID NO: 7).

FIG. 2 is a restriction map of plasmid pMON34705.

FIG. 3A-B depicts an anthranilate synthase amino acid sequence alignment comparing the Agrobacterium tumefaciens α-domain sequence (SEQ ID NO: 4) and the Sulfolobus solfataricus α-domain sequence (SEQ ID NO: 8).

FIG. 4A-B depicts the sequences of the 34 primers (SEQ ID NOs: 9-42) used to mutate SEQ ID NO: 1. The mutated codons are underlined and the changed bases are in lower case.

FIG. 5 depicts a restriction map of plasmid pMON58046.

FIG. 6 depicts a restriction map of plasmid pMON58043.

FIG. 7A-D depicts a multiple sequence alignment of monomeric “TrpEG” anthranilate synthases having SEQ ID NOs: 4 and 43 (derived from Agrobacterium tumefaciens and Rhizobium meliloti, respectively) with the TrpE (α) and TrpG (β) domains of heterotetrameric anthranilate synthases from Sulfolobus solfataricus (SEQ ID NO: 44) and Arabidopsis thaliana (SEQ ID NO: 45). Linker regions are underlined.

FIG. 8 is a restriction map of plasmid pMON39325.

FIG. 9 is a graph depicting free tryptophan levels in soybean seeds transformed with pMON39325. There were five observations from each event. NT represents non-transgenic soybean seed.

FIG. 10 depicts the sequence of the truncated trpE gene of Escherichia coli EMG2 (K-12 wt F+) (SEQ ID NO: 46). The first 30 bp and the last 150 bp of this trpE nucleic acid are connected by an EcoR1 restriction site. The beginning of the trpG gene follows the trpE stop codon.

FIG. 11 schematically depicts construction of the in-frame deletion in the E. coli trpE gene.

FIG. 12A-C depicts the DNA (SEQ ID NO: 1) and amino acid (SEQ ID NO: 4) sequences of the α-domain of the anthranilate synthase gene isolated from Agrobacterium tumefaciens.

FIG. 13A-C depicts the DNA (SEQ ID NO: 2) sequence of the α-domain of the anthranilate synthase gene isolated from Zea mays. FIG. 13D depicts the amino acid (SEQ ID NO: 5) sequence of the α-domain of the anthranilate synthase gene isolated from Zea mays.

FIG. 14 is a restriction map of plasmid pMON58120.

FIG. 15A-E provides a sequence comparison of anthranilate synthase amino acid sequences from Agrobacterium tumefaciens (AgrTu_(—)15889565) (SEQ ID NO: 4), Rhizobium meliloti (RhiMe_(—)136328) (SEQ ID NO: 7), Mesorhizobium loti (MesLo_(—)13472468) (SEQ ID NO: 77), Azospirillum brasilense (AzoBr_(—)1717765) (SEQ ID NO: 78), Brucella melitensis(BruMe_(—)17986732) (SEQ ID NO: 79), Nostoc sp. (Nostoc_(—)17227910) (SEQ ID NO: 80), Nostoc sp. (Nostoc _(—)17230725) (SEQ ID NO: 81), and Rhodopseudomonas palustris (RhoPa_TrpEG) (SEQ ID NO: 82).

FIG. 16A-B provides an optimized nucleotide sequence for Agrobacterium tumefaciens anthranilate synthase (SEQ ID NO: 75).

FIG. 17A-C provides an alignment of the wild type (top strand) and optimized (bottom strand) Agrobacterium tumefaciens anthranilate synthase nucleotide sequences (SEQ ID NOs: 1 and 75). These two sequences are 94% identical, as demonstrated by the middle strand.

FIG. 18 is a restriction map of plasmid pMON66877.

FIG. 19 is a restriction map of plasmid pMON66878.

FIG. 20 is a restriction map of plasmid pMON66879.

FIG. 21 is a restriction map of plasmid pMON66595.

FIG. 22 A-B provides a codon-optimized nucleotide sequence for the mutant S51F allele of Agrobacterium tumefaciens anthranilate synthase (SEQ ID NO: 144).

FIG. 23 A-C provides an alignment of the non-optimized (top strand; AS-S51F; SEQ ID NO: 86) and codon-optimized (bottom strand; AS (S51F) Codon_Opt; SEQ ID NO: 144) nucleotide sequences for the mutant S51F allele of Agrobacterium tumefaciens anthranilate synthase (AgroAS). The alignment was carried out using the ClustalW (Slow/Accurate, IUB) program. These two sequences are 82.9% identical and have no polynucleotide sequence lengths of more than 29 consecutive nucleotides that have 100% identity.

FIG. 24 A-B provides an alignment of the amino acid sequences of the non-optimized S51F mutant Agrobacterium tumefaciens anthranilate synthase (top strand; AgroAS-S51F; SEQ ID NO: 60) with the amino acid translation of the codon-optimized S51F mutant Agrobacterium tumefaciens anthranilate synthase (bottom strand; AS (S51F) Codon_Opt-Final; translation of SEQ ID NO: 144). The alignment was carried out using the ClustalW (Slow/Accurate, IUB) program. These two sequences are 100% identical.

FIG. 25 A-B provides a codon-optimized nucleotide sequence for the mutant S51C allele of tumefaciens anthranilate synthase (AgroAS-S51C-nno (non-native optimized gene); pMON68066; SEQ ID NO: 145).

FIG. 26 provides the amino acid translation of SEQ ID NO: 144, the codon-optimized mutant S51F allele of Agrobacterium tumefaciens anthranilate synthase (same as SEQ ID NO: 60).

FIG. 27 is a restriction map of plasmid pMON69770.

FIG. 28 is a restriction map of plasmid pMON68065.

FIG. 29 is a restriction map of plasmid pMON68066.

FIG. 30 is a graphic comparison of free tryptophan levels in kernels from non-transgenic corn plants vs. transgenic corn plants containing the plasmids pMON69770, pMON68065 and pMON68066.

FIG. 31 shows tryptophan synthetic pathway.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides isolated DNAs, vectors, host cells and transgenic plants comprising an isolated nucleic acid encoding an anthranilate synthase capable of providing high levels of tryptophan upon expression within the plant. In one embodiment, the isolated nucleic acid encodes a monomeric anthranilate synthase (AS). In other embodiments, the isolated nucleic acid encodes an anthranilate synthase, or a domain thereof, that is substantially resistant to inhibition by free L-tryptophan or an amino acid analog of tryptophan. Expression of the anthranilate synthase, or domain thereof, elevates the level of tryptophan, e.g., free tryptophan in the seed, over the level present in the plant absent such expression.

Methods are also provided for producing transgenic plants having nucleic acids associated with increased anthranilate synthase activity, and producing cultured cells, plant tissues, plants, plant parts and seeds that produce high levels of tryptophan. Such transgenic plants can preferably sexually transmit the ability to produce high levels of tryptophan to their progeny. Also described are methods for producing isolated DNAs encoding mutant anthranilate synthases, and cell culture selection techniques to select for novel genotypes that overproduce tryptophan and/or are resistant to tryptophan analogs. For example, to produce soybean lines capable of producing high levels of tryptophan, transgenic soybean cells that contain at least on of the isolated DNAs of the present invention, are prepared and characterized, then regenerated into plants. Some of the isolated DNAs are resistant to growth inhibition by the tryptophan analog. The methods provided in the present invention may also be used to produce increased levels of free tryptophan in dicot plants, such as other legumes, as well as in monocots, such as the cereal grains.

DEFINITIONS

As used herein, “altered” levels of tryptophan in a transformed plant, plant tissue, plant part or plant cell are levels which are greater or lesser than the levels found in the corresponding untransformed plant, plant tissue, plant part or plant cell.

As used herein, a “α-domain” is a portion of an enzyme or enzymatic complex that binds chorismate and eliminates the enolpyruvate side chain. Such an α-domain can be encoded by a TrpE gene. In some instances, the α-domain is a single polypeptide that functions only to bind chorismate and to eliminate the enolpyruvate side chain from chorismate. In other instances, the α-domain is part of a larger polypeptide that can carry out other enzymatic functions in addition to binding chorismate and eliminating the enolpyruvate side chain from chorismate.

The term “β-domain” refers to a portion of an enzyme or enzymatic complex that transfers an amino group from glutamine to the position on the chorismate ring that resides between the carboxylate and the enolpyruvate moieties. Such a β-domain can be encoded by a TrpG gene. In some instances, the β-domain is a single polypeptide that functions only to transfer an amino group from glutamine to the position on the chorismate ring that resides between the carboxylate and the enolpyruvate moieties. In other instances, the β-domain is part of a larger polypeptide that can carry out other enzymatic functions in addition to transferring an amino group from glutamine to the position on the chorismate ring that resides between the carboxylate and the enolpyruvate moieties.

As used herein, “an amino acid analog of tryptophan” is an amino acid that is structurally related to tryptophan and that can bind to the tryptophan-binding site in a wild type anthranilate synthase. These analogs include, but are not limited to, 6-methylanthranilate, 5-methyltryptophan, 4-methyltryptophan, 5-fluorotryptophan, 5-hydroxytryptophan, 7-azatryptophan, 3′-indoleacrylic acid, 3-methylanthranilic acid, and the like.

The phrase “consists essentially of” as used with respect to the present DNA molecules, sequences or segments is defined to mean that a major portion of the DNA molecule, sequence or segment encodes an anthranilate synthase. Unless otherwise indicated, the DNA molecule, sequence or segment generally does not encode proteins other than an anthranilate synthase.

The term “complementary to” is used herein to mean that the sequence of a nucleic acid strand could hybridize to all, or a portion, of a reference polynucleotide sequence. For illustration, the nucleotide sequence “TATAC” has 100% identity to a reference sequence 5′-TATAC-3′ but is 100% complementary to a reference sequence 5′-GTATA-3′.

As used herein, an “exogenous” anthranilate synthase is an anthranilate synthase that is encoded by an isolated DNA that has been introduced into a host cell, and that is preferably not identical to any DNA sequence present in the cell in its native, untransformed state. An “endogenous” or “native” anthranilate synthase is an anthranilate synthase that is naturally present in a host cell or organism.

As used herein, “increased,” “high” or “elevated” levels of free L-tryptophan in a plant cell, plant tissue, plant part or plant are levels that are about 2 to 200 times, preferably about 5 to 150 times, and more preferably about 10-100 times, the levels found in an untransformed plant cell, plant tissue, plant part or plant, i.e., one where the genome has not been altered by the presence of an exogenous anthranilate synthase nucleic acid or domain thereof. For example, the levels of free L-tryptophan in a transformed plant seed are compared with those in an untransformed plant seed (“the starting material”).

DNA molecules encoding an anthranilate synthase, and DNA molecules encoding a transit peptide or marker/reporter gene are “isolated” in that they were taken from their natural source and are no longer within the cell where they normally exist. Such isolated DNA molecules may have been at least partially prepared or manipulated in vitro, e.g., isolated from a cell in which they are normally found, purified, and amplified. Such isolated DNA molecules can also be “recombinant” in that they have been combined with exogenous DNA molecules or segments. For example, a recombinant DNA can be an isolated DNA that is operably linked to an exogenous promoter, or to a promoter that is endogenous to the host cell. The term embraces nucleic acids that are biochemically purified so as to substantially remove contaminating nucleic acids and other cellular components. The term also embraces recombinant nucleic acids and chemically synthesized nucleic acids.

As used herein with respect to anthranilate synthase, the term “monomeric” means that two or more anthranilate synthase domains are incorporated in a functional manner into a single polypeptide chain. The monomeric anthranilate synthase may be assembled in vivo into a dimeric form. Monomeric anthranilate synthase nucleic acids and polypeptides can be isolated from various organisms such as Agrobacterium tumefaciens, Anabaena M22983, Azospirillum brasilense, Brucella melitensis, Euglena gracilis, Mesorhizobium loti, Nostoc sp. PCC7120 or Rhizobium meliloti. Alternatively, monomeric anthranilate synthase nucleic acids and polypeptides can be constructed from a combination of domains selected from any convenient monomeric or multimeric anthranilate synthase gene. Such organisms include, for example, Agrobacterium tumefaciens, Anabaena M22983, Arabidopsis thaliana, Azospirillum brasilense, Brucella melitensis, Mesorhizobium loti, Nostoc sp. PCC7120, Rhizobium meliloti, Rhodopseudomonas palustris, Ruta graveolens, Sulfolobus solfataricus, Salmonella typhimurium, Serratia marcescens, soybean, rice, cotton, Zea mays, or any gene encoding a subunit or domain of anthranilate synthase. Nucleic acids encoding the selected domains can be linked recombinantly. For example, a nucleic acid encoding the C-terminus of an α-domain can be linked to a nucleic acid encoding the N-terminus of the β-domain, or vice versa, by forming a phosphodiester bond. As an alternative, such single domain polypeptides can be linked chemically. For example, the α-domain can be linked via its C-terminus to the N-terminus of the β-domain, or vice versa, by forming a peptide bond.

As used herein, a “native” gene means a gene that has not been changed in vitro, i.e., a “wild-type” gene that has not been mutated in vitro.

The term “plastid” refers to the class of plant cell organelles that includes amyloplasts, chloroplasts, chromoplasts, elaioplasts, eoplasts, etioplasts, leucoplasts, and proplastids. These organelles are self-replicating, and contain what is commonly referred to as a “chloroplast genome,” a circular DNA molecule that ranges in size from about 120 to about 217 kb, depending upon the plant species, and which usually contains an inverted repeat region.

As used herein, “polypeptide” means a continuous chain of amino acids that are all linked together by peptide bonds, except for the N-terminal and C-terminal amino acids that have amino and carboxylate groups, respectively, and that are not linked in peptide bonds. Polypeptides can have any length and can be post-translationally modified, for example, by glycosylation or phosphorylation.

As used herein, a plant cell, plant tissue or plant that is “resistant or tolerant to inhibition by an amino acid analog of tryptophan” is a plant cell, plant tissue, or plant that retains at least about 10% more anthranilate synthase activity in the presence of an analog of L-tryptophan, than a corresponding wild type anthranilate synthase. In general, a plant cell, plant tissue, or plant that is “resistant or tolerant to inhibition by an amino acid analog of tryptophan” can grow in an amount of an amino acid analog of tryptophan that normally inhibits growth of the untransformed plant cell, plant tissue, or plant, as determined by methodologies known to the art. For example, a homozygous backcross converted inbred plant transformed with a DNA molecule that encodes an anthranilate synthase that is substantially resistant or tolerant to inhibition by an amino acid analog of tryptophan grows in an amount of an amino acid analog of tryptophan that inhibits the growth of the corresponding, i.e., substantially isogenic, recurrent inbred plant.

As used herein, an anthranilate synthase that is “resistant or tolerant to inhibition by tryptophan or an amino acid analog of tryptophan” is an anthranilate synthase that retains greater than about 10% more activity than a corresponding wild-type or native susceptible anthranilate synthase, when the tolerant/resistant and wild type anthranilate synthases are exposed to equivalent amounts of tryptophan or an amino acid analog of tryptophan. Preferably the resistant or tolerant anthranilate synthase retains greater than about 20% more activity than a corresponding wild-type or native susceptible anthranilate synthase.

As used herein with respect to anthranilate synthase, the phrase “a domain thereof,” includes a structural or functional segment of a full-length anthranilate synthase. A structural domain includes an identifiable structure within the anthranilate synthase. An example of a structural domain includes an alpha helix, a beta sheet, an active site, a substrate or inhibitor binding site and the like. A functional domain includes a segment of an anthranilate synthase that performs an identifiable function such as a tryptophan binding pocket, an active site or a substrate or inhibitor binding site. Functional domains of anthranilate synthase include those portions of anthranilate synthase that can catalyze one step in the biosynthetic pathway of tryptophan. For example, an α-domain is a domain that can be encoded by trpE and that can transfer NH₃ to chorismate and form anthranilate. A β-domain can be encoded by trpG and can remove an amino group from glutamine to form ammonia. Hence, a functional domain includes enzymatically active fragments and domains of an anthranilate synthase. Mutant domains of anthranilate synthase are also contemplated. Wild type anthranilate synthase nucleic acids utilized to make mutant domains include, for example, any nucleic acid encoding a domain of Agrobacterium tumefaciens, Anabaena M22983, Arabidopsis thaliana, Azospirillum brasilense, Brucella melitensis, Mesorhizobium loti, Nostoc sp. PCC7120, Rhizobium meliloti, Rhodopseudomonas palustris, Ruta graveolens, Sulfolobus solfataricus, Salmonella typhimurium, Serratia marcescens, soybean, rice, cotton, wheat, tobacco, Zea mays, or any gene encoding a subunit or domain of anthranilate synthase that can comprise at least one amino acid substitution in the coding region thereof. Domains that are mutated or joined to form a monomeric anthranilate synthase having increased tryptophan biosynthetic activity, greater stability, reduced sensitivity to tryptophan or an analog thereof, and the like, are of particular interest.

The term “5′ UTR” refers to the untranslated region of DNA upstream, or 5′, of the coding region of a gene.

The term “3′ UTR” refers to the untranslated region of DNA downstream, or 3′, of the coding region of a gene.

The term “substantially homologous” refers to two sequences which are at least about 90% identical in sequence, as measured by the BestFit program described herein (Version 10; Genetics Computer Group, Inc., University of Wisconsin Biotechnology Center, Madison, Wis.), using default parameters.

The percent of sequence identity is preferably determined using the “Best Fit” or “Gap” program of the Sequence Analysis Software Package (Version 10; Genetics Computer Group, Inc., University of Wisconsin Biotechnology Center, Madison, Wis.). “Gap” utilizes the algorithm of Needleman and Wunsch (1970) to find the alignment of two sequences that maximizes the number of matches and minimizes the number of gaps.

“BestFit” performs an optimal alignment of the best segment of similarity between two sequences and inserts gaps to maximize the number of matches using the local homology algorithm of Smith and Waterman (Smith and Waterman, 1981; Smith et al., 1983). The percent identity is most preferably determined using the Best Fit program using default parameters. As used herein, the term “operatively linked” means that a promoter is connected to a coding region in such a way that the transcription of that coding region is controlled and regulated by that promoter. Means for operatively linking a promoter to a coding region are well known in the art.

The term “substantially identical” as used herein, refers to a comparison of polynucleotide molecules that encode the same or nearly the same protein or polypeptide. The four letter genetic code (A, G, C, and T/U) comprises three letter codons that direct t-RNA molecules to assemble amino acids into a polypeptide from an mRNA template. Having more than one codon that may code for the same amino acid is referred to as degenerate. Degenerate codons are used to construct substantially identical polynucleotide molecules that encode the same polypeptide where these polynucleotide molecules have a sequence of nucleotides when compared along their entire length in which they are at least 85% identical to one another, more preferably 86 to 90% identical to one another, even more preferably 91 to 95% identical to one another, or most preferably 96 to 99% identical to one another. Such sequences may differ due to changes at one or more bases, including a coding region with a truncation or deletion, yet still encode a polypeptide with anthranilate synthase activity.

An “artificial polynucleotide” as used in the present invention is a DNA sequence designed according to the methods of the present invention and created as an isolated DNA molecule for use in a DNA construct that provides expression of a protein in host cells, and for the purposes of cloning into appropriate constructs or other uses known to those skilled in the art. Computer programs are available for these purposes, including but not limited to the “BestFit” or “Gap” programs of the Sequence Analysis Software Package, Genetics Computer Group (GCG), Inc., University of Wisconsin Biotechnology Center, Madison, Wis. 53711. The artificial polynucleotide may be created by one or more methods known in the art that include, but are not limited to, overlapping PCR and chemical synthesis. An artificial polynucleotide molecule of the present invention is substantially identical to other polynucleotides that code for the identical or nearly identical protein.

“Codon optimization” as used in the present invention is a method of adjusting the frequency of codon usage of a polynucleotide encoding a protein derived from a foreign source to provide a pattern of codon usage that will result in enhanced expression of the protein in the host cell (see, for example, Fischhoff and Perlak, U.S. Pat. No. 5,500,365). In particular, codons to be used to optimize the artificial polynucleotides of the present invention are selected from a monocot high usage table according to the following criteria: a) after a known polynucleotide sequence is scanned using a sequence optimizer computer program, codons 1 through 50 are selected from the most abundant codons from a monocot high usage table, avoiding the use of rare codons; b) codons 51 to the end of the molecule are selected randomly from a monocot high usage table; c) codon repeats are avoided; d) regions that are A+T-rich or which many contain cryptic mRNA processing sites are identified and modified to increase the GC content and to avoid truncation of the processed molecule; and e) the resulting polynucleotide sequence is manually edited to remove unwanted restriction sites, insert stops, cryptic polyadenylation signals and other undesirable sequences. Computer programs such as the “BestFit” or “Gap” programs of the Sequence Analysis Software Package, Genetics Computer Group (GCG), Inc., University of Wisconsin Biotechnology Center, Madison, Wis. 53711, as well as other such programs designed for this purpose, may be used to adjust the frequency of codon utilization to provide enhanced expression of the protein in the host cell. A DNA sequence that has been subjected to such adjustment of codon usage is said to be “codon-optimized”.

The term “translation” refers to the production the corresponding gene product, i.e., a peptide, polypeptide, or protein from an mRNA.

As used herein, the term “nutritionally enhanced” refers to an elevated, increased or high level of a particular amino acid in a plant cell as compared to the level of the same amino acid found in an untransformed plant cell, plant tissue, plant part or plant, i.e., one where the genome has not been altered by the presence of an exogenous nucleic acid. For example, the levels of free L-tryptophan in a transformed plant seed are compared with those in an untransformed plant seed (“the starting material”).

As used herein, an “isogenic” line or event or cell is identical to the transgenic line or event or cell with respect to its genetic composition except that the transgenic line or event or cell contains additional DNA that has been introduced into its genome that is not present in the isogenic line or event or cell.

General Concepts

The present invention relates to novel nucleic acids and methods for obtaining plants that produce elevated levels of free L-tryptophan. The overproduction results from the introduction and expression of a nucleic acid encoding anthranilate synthase, or a domain thereof. Such anthranilate synthase nucleic acids include wild type or mutant α-domains, or monomeric forms of anthranilate synthase. A monomeric form of anthranilate synthase comprises at least two anthranilate synthase domains in a single polypeptide chain, e.g., an α-domain linked to a β-domain.

Native plant anthranilate synthases are generally quite sensitive to feedback inhibition by L-tryptophan and analogs thereof. Such inhibition constitutes a key mechanism for regulating the tryptophan synthetic pathway. Therefore, an anthranilate synthase or a domain thereof that is highly active, more efficient or that is inhibited to a lesser extent by tryptophan or an analog thereof will likely produce elevated levels of tryptophan. According to the invention, the Agrobacterium tumefaciens anthranilate synthase is particularly useful for producing high levels of tryptophan.

To generate high levels of tryptophan in a plant or a selected host cell, the selected anthranilate synthase nucleic acid is isolated and may be manipulated in vitro to include regulatory signals required for gene expression in plant cells or other cell types. Because the tryptophan biosynthetic pathway in plants is reported to be present within plastids, the exogenous anthranilate synthase nucleic acids are either introduced into plastids or are modified by adding a nucleic acid segment encoding an amino-terminal plastid transit peptide. Such a plastid transit peptide can direct the anthranilate synthase gene product into plastids. In some instances the anthranilate synthase may already contain a plastid transport sequence, in which case there is no need to add one.

In order to alter the biosynthesis of tryptophan, the nucleic acid encoding an anthranilate synthase activity must be introduced into plant cells or other host cells and these transformed cells identified, either directly or indirectly. An entire anthranilate synthase or a useful portion or domain thereof can be used. The anthranilate synthase is stably incorporated into the plant cell genome. The transcriptional signals controlling expression of the anthranilate synthase must be recognized by and be functional within the plant cells or other host cells. That is, the anthranilate synthase must be transcribed into messenger RNA (mRNA), and the mRNA must be stable in the plant cell nucleus and be transported intact to the cytoplasm for translation. The anthranilate synthase mRNA must have appropriate translational signals to be recognized and properly translated by plant cell ribosomes. The polypeptide gene product must substantially escape proteolytic attack in the cytoplasm, be transported into the correct cellular compartment (e.g. a plastid) and be able to assume a three-dimensional conformation that will confer enzymatic activity. The anthranilate synthase must further be able to function in the biosynthesis of tryptophan and its derivatives; that is, it must be localized near the native plant enzymes catalyzing the flanking steps in biosynthesis (presumably in a plastid) in order to obtain the required substrates and to pass on the appropriate product.

Even if all these conditions are met, successful overproduction of tryptophan is not a predictable event. The expression of some transgenes may be negatively affected by nearby chromosomal elements. If the high level of tryptophan is achieved by mutation to reduce feedback inhibition, there may be other control mechanisms compensating for the reduced regulation at the anthranilate synthase step. There may be mechanisms that increase the rate of breakdown of the accumulated amino acids. Tryptophan and related amino acids must be also overproduced at levels that are not toxic to the plant. Finally, the introduced trait must be stable and heritable in order to permit commercial development and use.

Isolation and Identification of DNA Coding for an Anthranilate Synthase

Nucleic acids encoding an anthranilate synthase can be identified and isolated by standard methods, for example, as described by Sambrook et al., (1989); Sambrook and Russell, (2001). For example, a DNA sequence encoding an anthranilate synthase or a domain thereof can be identified by screening of a DNA or cDNA library generated from nucleic acid derived from a particular cell type, cell line, primary cells, or tissue. Examples of libraries useful for identifying and isolating an anthranilate synthase include, but are not limited to, a cDNA library derived from Agrobacterium tumefaciens strain A348, maize inbred line B73 (Stratagene, La Jolla, Calif., Cat. #937005, Clontech, Palo Alto, Calif., Cat. # FL1032a, #FL1032b, and FL1032n), genomic library from maize inbred line Mo17 (Stratagene, Cat. #946102), genomic library from maize inbred line B73 (Clontech, Cat. # FL1032d), genomic DNA from Anabaena M22983 (e.g., Genbank Accession No. GI 152445), Arabidopsis thaliana, Azospirillum brasilense (e.g., Genbank Accession No. GI 1174156), Brucella melitensis (GI 17982357), Escherichia coli, Euglena gracilis, Mesorhizobium loti (e.g., Genbank Accession No. GI 13472468), Nostoc sp. PCC7120 (e.g., Genbank Accession No. GI 17227910 or GI 17230725), Rhizobium meliloti (e.g., Genbank Accession No. GI 95177), Ruta graveolens, Rhodopseudomonas palustris, Salmonella typhimurium, Serratia marcescens, Sulfolobus solfataricus, soybean, rice, cotton, wheat, tobacco, Zea mays (maize), or other species. Moreover, anthranilate synthase nucleic acids can be isolated by nucleic acid amplification procedures using genomic DNA, mRNA or cDNA isolated from any of these species.

Screening for DNA fragments that encode all or a portion of the sequence encoding an anthranilate synthase can be accomplished by screening plaques from a genomic or cDNA library for hybridization to a probe of an anthranilate synthase gene from other organisms or by screening plaques from a cDNA expression library for binding to antibodies that specifically recognize anthranilate synthase. DNA fragments that hybridize to anthranilate synthase probes from other organisms and/or plaques carrying DNA fragments that are immunoreactive with antibodies to anthranilate synthase can be subcloned into a vector and sequenced and/or used as probes to identify other cDNA or genomic sequences encoding all or a portion of the desired anthranilate synthase gene. Preferred cDNA probes for screening a maize or plant library can be obtained from plasmid clones pDPG600 or pDPG602.

A cDNA library can be prepared, for example, by random oligo priming or oligo dT priming. Plaques containing DNA fragments can be screened with probes or antibodies specific for anthranilate synthase. DNA fragments encoding a portion of an anthranilate synthase gene can be subcloned and sequenced and used as probes to identify a genomic anthranilate synthase gene. DNA fragments encoding a portion of a bacterial or plant anthranilate synthase can be verified by determining sequence homology with other known anthranilate synthase genes or by hybridization to anthranilate synthase-specific messenger RNA. Once cDNA fragments encoding portions of the 5′, middle and 3′ ends of an anthranilate synthase are obtained, they can be used as probes to identify and clone a complete genomic copy of the anthranilate synthase gene from a genomic library.

Portions of the genomic copy or copies of an anthranilate synthase gene can be sequenced and the 5′ end of the gene identified by standard methods including either by DNA sequence homology to other anthranilate synthase genes or by RNAase protection analysis, for example, as described by Sambrook et al., (1989); (2001). The 3′ and 5′ ends of the target gene can also be located by computer searches of genomic sequence databases using known AS coding regions. Once portions of the 5′ end of the gene are identified, complete copies of the anthranilate synthase gene can be obtained by standard methods, including cloning or polymerase chain reaction (PCR) synthesis using oligonucleotide primers complementary to the DNA sequence at the 5′ end of the gene. The presence of an isolated full-length copy of the anthranilate synthase gene can be verified by hybridization, partial sequence analysis, or by expression of a maize anthranilate synthase enzyme.

Exemplary isolated DNAs of the invention include DNAs having the following nucleotide SEQ ID NO:

SEQ ID NO: 1 Agrobacterium tumefaciens (wild type)

SEQ ID NO: 2 Zea mays (wild type, alpha2)

SEQ ID NO: 3 Ruta graveolens

SEQ ID NO: 46 truncated trpE gene of E. coli EMG2 (K-12 wt F+)

SEQ ID NO: 67 Zea mays (C28 mutant)

SEQ ID NO: 68 Zea mays (C28+terminator)

SEQ ID NO: 71 Chloroplast Targeting Peptide (g)

SEQ ID NO: 73 Chloroplast Targeting Peptide (a)

SEQ ID NO: 75 Agrobacterium tumefaciens (optimized)

SEQ ID NO: 76 Rhodopseudomonas palustris

SEQ ID NO: 83 Rhodopseudomonas palustris (RhoPa_TrpEG)

SEQ ID NO: 84 Agrobacterium tumefaciens V48F mutant

SEQ ID NO: 85 Agrobacterium tumefaciens V48Y mutant

SEQ ID NO: 86 Agrobacterium tumefaciens S51F mutant

SEQ ID NO: 87 Agrobacterium tumefaciens S51C mutant

SEQ ID NO: 88 Agrobacterium tumefaciens N52F mutant

SEQ ID NO: 89 Agrobacterium tumefaciens P293A mutant

SEQ ID NO: 90 Agrobacterium tumefaciens P293G mutant

SEQ ID NO: 91 Agrobacterium tumefaciens F298W mutant

SEQ ID NO: 92 Agrobacterium tumefaciens S50K mutant

SEQ ID NO: 93 Agrobacterium tumefaciens F298A mutant

SEQ ID NO: 94 rice

SEQ ID NO: 95 rice isozyme

SEQ ID NO: 96 maize (U.S. Pat. No. 6,118,047 to Anderson)

SEQ ID NO: 97 wheat

SEQ ID NO: 98 tobacco

SEQ ID NO: 104 Gossypium hirsutum (alpha)

SEQ ID NO: 105 Gossypium hirsutum (beta)

SEQ ID NO: 106 Glycine max (alpha)

SEQ ID NO: 107 Glycine max (beta)

SEQ ID NO: 112 Glycine max (alpha) with 5′ and 3′UTRs

SEQ ID NO: 113 Glycine max (beta) with 5′ and 3′UTRs

SEQ ID NO: 116 Zea mays (beta)

SEQ ID NO: 119 Oryza sativa (beta1)

SEQ ID NO: 120 Oryza sativa (beta2)

SEQ ID NO: 121 Mesorhizobium loti

SEQ ID NO: 122 Azospirillum brasilense

SEQ ID NO: 123 Brucella melitensis

SEQ ID NO: 124 Nostoc sp.

SEQ ID NO: 125 Nostoc sp.

SEQ ID NO: 126 Rhodopseudomonas palustris

SEQ ID NO: 127 Bradyrhizobium japonicum

SEQ ID NO: 128 Rhodospirillum rubrum

SEQ ID NO: 129 Thermobifida fusca

SEQ ID NO: 134 Sorghum bicolor (beta1)

SEQ ID NO: 135 Sorghum bicolor (beta2)

SEQ ID NO: 136 Zea mays (alpha1)

SEQ ID NO: 144 Agrobacterium tumefaciens codon-optimized S51F mutant

SEQ ID NO: 145 Agrobacterium tumefaciens codon-optimized S51C mutant

SEQ ID NO: 146 ASA2(+18); Zea mays sequence encoding a chloroplast targeting peptide (CTP) fused to 18 amino acids from the N-terminus of the Zea mays anthranilate synthase a2 protein (ZmASA2).

SEQ ID NO: 148 Oryza sativa glutelin 1 (Os-gt1) 3′-UTR

SEQ ID NO: 149 DNA sequence of transformation vector comprising transcription units of pMON66879

Certain primers are also useful for the practice of the present invention, for example, primers having SEQ ID NOs: 9-42, 47-56, or 138-143.

The present invention also contemplates any isolated nucleic acid encoding an anthranilate synthase having, for example, any one of the following amino acid sequences.

SEQ ID NO: 4 Agrobacterium tumefaciens (wild type)

SEQ ID NO: 5 Zea mays (wild type)

SEQ ID NO: 6 Ruta graveolens

SEQ ID NO: 7 Rhizobium meliloti

SEQ ID NO: 8 Sulfolobus solfataricus

SEQ ID NO: 43 Rhizobium meliloti

SEQ ID NO: 44 Sulfolobus solfataricus

SEQ ID NO: 45 Arabidopsis thaliana

SEQ ID NO: 57 Rhodopseudomonas palustris

SEQ ID NO: 58 Agrobacterium tumefaciens V48F mutant

SEQ ID NO: 59 Agrobacterium tumefaciens V48Y mutant

SEQ ID NO: 60 Agrobacterium tumefaciens S51F mutant

SEQ ID NO: 61 Agrobacterium tumefaciens S51C mutant

SEQ ID NO: 62 Agrobacterium tumefaciens N52F mutant

SEQ ID NO: 63 Agrobacterium tumefaciens P293A mutant

SEQ ID NO: 64 Agrobacterium tumefaciens P293G mutant

SEQ ID NO: 65 Agrobacterium tumefaciens F298W mutant

SEQ ID NO: 66 Zea mays C28 mutant

SEQ ID NO: 69 Agrobacterium tumefaciens S50K mutant

SEQ ID NO: 70 Agrobacterium tumefaciens F298A mutant

SEQ ID NO: 74 Chloroplast Targeting Peptide (a)

SEQ ID NO: 72 Chloroplast Targeting Peptide (g)

SEQ ID NO: 77 Mesorhizobium loti (MesLo_(—)13472468)

SEQ ID NO: 78 Azospirillum brasilense (AzoBr_(—)1717765)

SEQ ID NO: 79 Brucella melitensis (BruMe_(—)17986732)

SEQ ID NO: 80 Nostoc sp. (Nostoc _(—)17227910)

SEQ ID NO: 81 Nostoc sp. (Nostoc _(—)17230725)

SEQ ID NO: 82 Rhodopseudomonas palustris RhoPa_TrpEG

SEQ ID NO: 99 rice

SEQ ID NO: 100 rice isozyme

SEQ ID NO: 101 maize (U.S. Pat. No. 6,118,047 to Anderson)

SEQ ID NO: 102 wheat

SEQ ID NO: 103 tobacco

SEQ ID NO: 108 Gossypium hirsutum (alpha)

SEQ ID NO: 109 Gossypium hirsutum (beta)

SEQ ID NO: 110 Glycine max (alpha)

SEQ ID NO: 111 Glycine max (beta)

SEQ ID NO: 114 Zea mays (ASalpha2) chloroplast targeting peptide

SEQ ID NO: 115 Zea mays (ASalpha1) chloroplast targeting peptide

SEQ ID NO: 117 Oryza sativa (beta)

SEQ ID NO: 118 Zea mays (beta)

SEQ ID NO: 130 Bradyrhizobium japonicum

SEQ ID NO: 131 Rhodospirillum rubrum

SEQ ID NO: 132 Thermobifida fusca

SEQ ID NO: 133 Sorghum bicolor (beta)

SEQ ID NO: 137 Zea mays (ASalpha1)

SEQ ID NO: 147 ASA2(+18); Zea mays chloroplast targeting peptide (CTP) fused to 18 amino acids from the N-terminus of the Zea mays anthranilate synthase a2 protein

Any of these nucleic acids and polypeptides can be utilized in the practice of the invention, as well as any mutant, variant or derivative thereof.

Monomeric Anthranilate Synthases

According to the present invention, monomeric anthranilate synthases from plant and non-plant species are functional in plants and can provide high levels of tryptophan. Surprisingly, monomeric anthranilate synthases from non-plant species function very well in plants even though the sequences of these monomeric anthranilate synthases have low homology with most plant anthranilate synthases. For example, monomeric anthranilate synthases from species as diverse as bacteria, protists, and microbes can be used successfully. In particular, monomeric anthranilate synthases from bacterial species such as Agrobacterium tumefaciens, Rhizobium meliloti, Mesorhizobium loti, Brucella melitensis, Nostoc sp. PCC7120, Azospirillum brasilense, Anabaena M22983, Bradyrhizobium janonicum, Rhodospirillum rubrum, and Thermobidfida fusca are functional in plants and can provide high levels of tryptophan, despite the rather low sequence identity of these monomeric anthranilate synthases with most plant anthranilate synthases.

Transgenic plants containing, for example, the wild type monomeric Agrobacterium tumefaciens anthranilate synthase can produce up to about 10,000 to about 12,000 ppm tryptophan in seeds, with average trp levels ranging up to about 7,000 to about 8,000 ppm. Non-transgenic soybean plants normally have up to only about 100 to about 200 ppm tryptophan in seeds. By comparison transgenic plants containing an added mutant Zea maysα domain produce somewhat lower levels of tryptophan (e.g., averages up to about 3000 to about 4000 ppm).

Monomeric enzymes may have certain advantages over multimeric enzymes. For example, while the present invention is not to be limited to a specific mechanism, a monomeric enzyme may provide greater stability, coordinated expression, and the like. When domains or subunits of a heterotetrameric anthranilate synthase are synthesized in vivo, those domains/subunits must properly assemble into a heterotetrameric form before the enzyme becomes active. Addition of a single domain of anthranilate synthase by transgenic means to a plant may not provide overproduction of the entire heterotetrameric enzyme because there may not be sufficient endogenous amounts of the non-transgenic domains to substantially increase levels of the functional tetramer. Hence, nucleic acids, vectors and enzymes encoding a monomeric anthranilate synthase can advantageously be used to overproduce all of the enzymatic functions of anthranilate synthase.

According to the present invention, anthranilate synthase domains from species that naturally produce heterotetrameric anthranilate synthases can be fused or linked to provide monomeric anthranilate synthases that can generate high tryptophan levels when expressed within a plant cell, plant tissue or seed. For example, a monomeric anthranilate synthase can be made by fusing or linking the α- and β-domains of anthranilate synthase so that the sequence of the α-β fusion generally aligns with an anthranilate synthase that is naturally monomeric. Examples of sequence alignments of monomeric and heterotetrameric anthranilate synthases are shown in FIGS. 7 and 15. Using such sequence alignments, the spacing and orientation of anthranilate synthase domains can be adjusted or modified to generate a monomeric anthranilate construct from heterotetrameric domains that optimally aligns with naturally monomeric anthranilate synthases. Such a fusion protein can be used to increase tryptophan levels in the tissues of a plant.

Heterotetrameric anthranilate synthases, such as the Sulfolobus solfataricus anthranilate synthase (e.g., Genbank Accession No. GI1004323), share between about 30% to about 87% sequence homology with heterotetrameric anthranilate synthases from other plant and microbial species. Monomeric anthranilate synthases, such as the A. tumefaciens anthranilate synthase, have between about 84% and about 52% identity to the other monomeric enzymes (e.g. Rhizobium meliloti (Genbank Accession No. GI 15966140) and Azospirillum brasilense (Genbank Accession No. 1717765), respectively; and Bae et al., (1989); De Troch et al., (1997).

However, the overall sequence identity shared between naturally monomeric and naturally heterotetrameric anthranilate synthases can be less than 30%. Hence, visual alignment rather than computer-generated alignment, may be needed to optimally align monomeric and heterotetrameric anthranilate synthases. Landmark structures and sequences within the anthranilate synthases can facilitate sequences alignments. For example, the motif “LLES” is part of a β-sheet of the β-sandwich that forms the tryptophan-binding pocket of anthranilate synthases. Such landmark sequences can be used to more confidently align divergent anthranilate synthase sequences, and are especially useful for determination of key residues involved in tryptophan binding.

To accomplish the fusion or linkage of anthranilate synthase domains, the C-terminus of the selected TrpE or α-domain is linked to the N-terminus of the TrpG domain or β-domain. In some cases, a linker peptide may be utilized between the domains to provide the appropriate spacing and/or flexibility. Appropriate linker sequences can be identified by sequence alignment of monomeric and heterotetrameric anthranilate synthases.

The selected β-domains can be cloned, for example, by hybridization, PCR amplification or as described in Anderson et al., U.S. Pat. No. 6,118,047. A plastid transit peptide sequence can also be linked to the anthranilate synthase coding region using standard methods. For example, an Arabidopsis small subunit (SSU) chloroplast targeting peptide (CTP, SEQ ID NOs: 71-74) may be used for this purpose. See also Stark et al., (1992). The fused gene can then be inserted into a suitable vector for plant transformation as described herein.

Anthranilate Synthase Mutants

Mutant anthranilate synthases contemplated by the present invention can have any type of mutation including, for example, amino acid substitutions, deletions, insertions, and/or rearrangements. Such mutants can be derivatives or variants of anthranilate synthase nucleic acids and polypeptides specifically identified herein. Alternatively, mutant anthranilate synthases can be obtained from any available species, including those not explicitly identified herein. The mutants, derivatives and variants can have identity with at least about 30% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity. In a preferred embodiment, polypeptide derivatives and variants have identity with at least about 50% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity. In a more preferred embodiment, polypeptide derivatives and variants have identity with at least about 60% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity. In a more preferred embodiment, polypeptide derivatives and variants have identity with at least about 70% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity. In an even more preferred embodiment, polypeptide derivatives and variants have identity with at least about 80% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity. In an even more preferred embodiment, polypeptide derivatives and variants have identity with at least about 90% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity. In an even more preferred embodiment, polypeptide derivatives and variants have identity with at least about 95% of the amino acid positions of any one of SEQ ID NOs: 4-8, 43-45, 57-66, 69-70, 77-82, 99-111, 117-118, 130-133 and 137, and have anthranilate synthase activity.

In one embodiment, anthranilate synthase mutants, variants and derivatives can be identified by hybridization of any one of SEQ ID NOs: 1-3, 46, 47-56, 67-68, 75-76, 83-98, 104-107, 112, 113, 116, 119-129, 134-136, and 138-145, or a fragment or primer thereof under moderate or, preferably, high stringency conditions to a selected source of nucleic acids. Moderate and stringent hybridization conditions are well known to the art, (e.g. sections 0.47-9.51 of Sambrook et al., (1989); and Sambrook and Russell, (2001)). For example, stringent conditions are those that (1) employ low ionic strength and high temperature for washing, for example, 0.015 M NaCl/0.0015 M sodium citrate (SSC); 0.1% sodium lauryl sulfate (SDS) at 50° C., or (2) employ a denaturing agent such as formamide during hybridization, e.g., 50% formamide with 0.1% bovine serum albumin/0.1% Ficoll/0.1% polyvinylpyrrolidone/50 mM sodium phosphate buffer at pH 6.5 with 750 mM NaCl, 75 mM sodium citrate at 42° C. Another example is use of 50% formamide, 5×SSC (0.75 M NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1% sodium pyrophosphate, 5×Denhardt's solution, sonicated salmon sperm DNA (50 μg/ml), 0.1% sodium dodecylsulfate (SDS), and 10% dextran sulfate at 42° C., with washes at 42° C. in 0.2×SSC and 0.1% SDS.

The present invention further provides hybridization probes and primers comprising a novel isolated and purified DNA segment of at least seven nucleotide bases, which can be detectably labeled or bind to a detectable label. Such a hybridization probe or primer can hybridize under moderate or high stringency conditions to either strand of a DNA molecule that encodes an anthranilate synthase. Examples of such hybridization probes and primers include any one of SEQ ID NOs: 9-42, 47-56, and 138-143.

The anthranilate synthase can be any anthranilate synthase, or a mutant or domain thereof, such as the α-domain. The anthranilate synthase can be a monomeric anthranilate synthase. Functional mutants are preferred, particularly those that can generate high levels of tryptophan in a plant, for example, those mutants that are substantially resistant to inhibition by an amino acid analog of tryptophan.

Nucleic acids encoding mutant anthranilate synthases can also be generated from any convenient species, for example, from nucleic acids encoding any domain of Agrobacterium tumefaciens, Anabaena M22983 (e.g. Genbank Accession No. GI 152445), Arabidopsis thaliana, Azospirillum brasilense (e.g., Genbank Accession No. GI 1174156), Brucella melitensis (e.g., Genbank Accession No. GI 17982357), Escherichia coli, Euglena gracilis, Mesorhizobium loti (e.g., Genbank Accession No. GI 13472468), Nostoc sp. PCC7120 (e.g., Genbank Accession No. GI 17227910 or GI 17230725), Rhizobium meliloti (e.g., Genbank Accession No. GI 95177), Ruta graveolens, Rhodopseudomonas palustris, Salmonella typhimurium, Serratia marcescens, Sulfolobus solfataricus, Bradyrhizobium japonicum, Rhodospirillum rubrum, Thermobifida fusca, Sorghum bicolor, soybean, rice, cotton, wheat, tobacco, Zea mays (maize), or any gene encoding a subunit or domain of anthranilate synthase.

Mutants having increased anthranilate synthase activity, reduced sensitivity to feedback inhibition by tryptophan or analogs thereof, and/or the ability to generate increased amounts of tryptophan in a plant are desirable. Such mutants do have a functional change in the level or type of activity they exhibit and are sometimes referred to as “derivatives” of the anthranilate synthase nucleic acids and polypeptides provided herein.

However, the present invention also contemplates anthranilate synthase variants as well as anthranilate synthase nucleic acids with “silent” mutations. As used herein, a silent mutation is a mutation that changes the nucleotide sequence of the anthranilate synthase but that does not change the amino acid sequence of the encoded anthranilate synthase. A variant anthranilate synthase is encoded by a mutant nucleic acid and the variant has one or more amino acid changes that do not substantially change its activity when compared to the corresponding wild type anthranilate synthase. The invention is directed to all such derivatives, variants and anthranilate synthases nucleic acids with silent mutations.

DNA encoding a mutated anthranilate synthase that is resistant and/or tolerant to L-tryptophan or amino acid analogs of tryptophan can be obtained by several methods. The methods include, but are not limited to:

1. spontaneous variation and direct mutant selection in cultures;

2. direct or indirect mutagenesis procedures on tissue cultures of any cell types or tissue, seeds or plants;

3. mutation of the cloned anthranilate synthase gene by methods such as by chemical mutagenesis; site specific or site directed mutagenesis Sambrook et al., cited supra), transposon mediated mutagenesis (Berg et al., 1983), and deletion mutagenesis (Mitra et al., 1989);

4. rational design of mutations in key residues; and

5. DNA shuffling to incorporate mutations of interest into various anthranilate synthase nucleic acids.

For example, protein structural information from available anthranilate synthase proteins can be used to rationally design anthranilate synthase mutants that have a high probability of having increased activity or reduced sensitivity to tryptophan or tryptophan analogs. Such protein structural information is available, for example, on the Solfolobus solfataricus anthranilate synthase (Knochel et al., 1999). Rational design of mutations can be accomplished by alignment of the selected anthranilate synthase amino acid sequence with the anthranilate synthase amino acid sequence from an anthranilate synthase of known structure, for example, Sulfolobus solfataricus. See FIGS. 3, 7, and 15. The predicted tryptophan binding and catalysis regions of the anthranilate synthase protein can be assigned by combining the knowledge of the structural information with the sequence homology. For example, residues in the tryptophan binding pocket can be identified as potential candidates for mutation to alter the resistance of the enzyme to feedback inhibition by tryptophan. Using such structural information, several Agrobacterium tumefaciens anthranilate synthase mutants were rationally designed in the site or domain involved in tryptophan binding.

Using such sequence and structural analysis, regions analogous to the monomeric Agrobacterium tumefaciens anthranilate synthase at approximately positions 25-60 or 200-225 or 290-300 or 370-375 were identified in the monomeric Agrobacterium tumefaciens anthranilate synthase as being potentially useful residues for mutation to produce active anthranilate synthases that may have less sensitivity to tryptophan feedback inhibition. More specifically, amino acids analogous to P29, E30, S31, 132, S42, V43, V48, S50, S51, N52, N204, P205, M209, F210, G221, N292, P293, F298, and A373 in the monomeric Agrobacterium tumefaciens anthranilate synthase are identified as being potentially useful residues for mutation to produce active anthranilate synthases that may have less sensitivity to tryptophan feedback inhibition. The present invention contemplates any amino acid substitution or insertion at any of these positions. Alternatively, the amino acid at any of these positions can be deleted.

Site directed mutagenesis can be used to generate amino acid substitutions, deletions and insertions at a variety of sites. Examples of specific mutations made within the Agrobacterium tumefaciens anthranilate synthase coding region include the following:

at about position 48 replace Val with Phe (see e.g., SEQ ID NO: 58);

at about position 48 replace Val with Tyr (see e.g., SEQ ID NO: 59);

at about position 51 replace Ser with Phe (see e.g., SEQ ID NO: 60);

at about position 51 replace Ser with Cys (see e.g., SEQ ID NO: 61);

at about position 52 replace Asn with Phe (see e.g., SEQ ID NO: 62);

at about position 293 replace Pro with Ala (see e.g., SEQ ID NO: 63);

at about position 293 replace Pro with Gly (see e.g., SEQ ID NO: 64); or

at about position 298 replace Phe with Trp (see e.g., SEQ ID NO: 65).

Similar mutations can be made in analogous positions of any anthranilate synthase by alignment of the amino acid sequence of the anthranilate synthase to be mutated with an Agrobacterium tumefaciens anthranilate synthase amino acid sequence. One example of an Agrobacterium tumefaciens anthranilate synthase amino acid sequence that can be used for alignment is SEQ ID NO: 4.

Useful mutants can also be identified by classical mutagenesis and genetic selection. A functional change can be detected in the activity of the enzyme encoded by the gene by exposing the enzyme to free L-tryptophan or amino acid analogs of tryptophan, or by detecting a change in the DNA molecule using restriction enzyme mapping or DNA sequence analysis.

For example, a gene encoding an anthranilate synthase substantially tolerant to 5-methyltryptophan (5-MT) can be isolated from a 5-methyltryptophan tolerant cell line. See U.S. Pat. No. 4,581,847, the disclosure of which is incorporated by reference herein. Briefly, partially differentiated plant cell cultures are grown and subcultured with continuous exposures to low levels of 5-methyltryptophan. 5-methyltryptophan concentrations are then gradually increased over several subculture intervals. Cells or tissues growing in the presence of normally toxic 5-methyltryptophan levels are repeatedly subcultured in the presence of 5-methyltryptophan and characterized. Stability of the 5-methyltryptophan tolerance trait of the cultured cells may be evaluated by growing the selected cell lines in the absence of 5-methyltryptophan for various periods of time and then analyzing growth after exposing the tissue to 5-methyltryptophan. Cell lines that are tolerant by virtue of having an altered anthranilate synthase enzyme can be selected by identifying cell lines having enzyme activity in the presence of normally toxic, i.e., growth inhibitor, levels of 5-methyltryptophan.

The anthranilate synthase gene cloned from a 5-MT- or 6-methylanthramilate (6-MA)-resistant cell line can be assessed for tolerance to 5-MT, 6-MA, or other amino acid analogs of tryptophan by standard methods, as described in U.S. Pat. No. 4,581,847, the disclosure of which is incorporated by reference herein.

Cell lines with an anthranilate synthase of reduced sensitivity to 5-methyltryptophan inhibition can be used to isolate a 5-methyltryptophan-resistant anthranilate synthase. A DNA library from a cell line tolerant to 5-methyltryptophan can be generated and DNA fragments encoding all or a portion of an anthranilate synthase gene can be identified by hybridization to a cDNA probe encoding a portion of an anthranilate synthase gene. A complete copy of the altered gene can be obtained either by cloning and ligation or by PCR synthesis using appropriate primers. The isolation of the altered gene coding for anthranilate synthase can be confirmed in transformed plant cells by determining whether the anthranilate synthase being expressed retains enzyme activity when exposed to normally toxic levels of 5-methyltryptophan (Anderson et al., U.S. Pat. No. 6,118,047).

Codon Optimization

In general, expression of bacterial genes in plants may often result in inefficient protein accumulation. This can be attributed to suboptimal usage of amino acid codons in the host plant, resulting in limitation of some of the tRNAs corresponding to rare codons of the bacterial gene, presence of cryptic mRNA processing sites, or due to inefficient initiation of translation. The major block to efficient bacterial gene expression in plants appears to be at the translational level, as evidenced by low level expression of wild-type cry1A(b) and cry1A(c) genes in tobacco, tomato and cotton (Perlak et al., 1991). One of the efficient methods to optimize or to correct low levels of protein expression is to modify the transgene DNA sequence in accordance with the favorable codon usage preference of the host organism. In addition, modification of the regions that are A+T-rich resembling plant introns (Goodall and Filipowich, 1989) or potential polyadenylation signal sequences and mRNA splicing sites (Dean et al., 1986) may enhance the stability of the mRNA in the host organism.

Coding regions of any DNA molecule provided herein can also be optimized for expression in a selected organism, for example, a selected plant or other host cell type. An example of a DNA molecule having optimized codon usage for a selected plant is an Agrobacterium tumefaciens anthranilate synthase DNA molecule having SEQ ID NO: 75. This optimized Agrobacterium tumefaciens anthranilate synthase DNA (SEQ ID NO: 75) has 94% identity with SEQ ID NO: 1. A further example of a DNA molecule having optimized codon usage for a selected plant is an artificial polynucleotide sequence having SEQ ID NO: 144. This codon-optimized S51F mutant of Agrobacterium tumefaciens anthranilate synthase DNA (SEQ ID NO: 144) has 82.9% identity with SEQ ID NO: 86. Yet another example of a DNA molecule having optimized codon usage for a selected plant is an artificial polynucleotide sequence having SEQ ID NO: 145. This codon-optimized S51C mutant of Agrobacterium tumefaciens anthranilate synthase DNA (SEQ ID NO: 145) has 82.9% identity with SEQ ID NO: 87.

Transgenes and Vectors

Once a nucleic acid encoding anthranilate synthase or a domain thereof is obtained and amplified, it is operably combined with a promoter and, optionally, with other elements to form a transgene.

Most genes have regions of DNA sequence that are known as promoters and which regulate gene expression. Promoter regions are typically found in the flanking DNA sequence upstream from the coding sequence in both prokaryotic and eukaryotic cells. A promoter sequence provides for regulation of transcription of the downstream gene sequence and typically includes from about 50 to about 2,000 nucleotide base pairs. Promoter sequences also contain regulatory sequences such as enhancer sequences that can influence the level of gene expression. Some isolated promoter sequences can provide for gene expression of heterologous genes, that is, a gene different from the native or homologous gene. Promoter sequences are also known to be strong or weak or inducible. A strong promoter provides for a high level of gene expression, whereas a weak promoter provides for a very low level of gene expression. An inducible promoter is a promoter that provides for turning on and off of gene expression in response to an exogenously added agent or to an environmental or developmental stimulus. Promoters can also provide for tissue specific or developmental regulation. An isolated promoter sequence that is a strong promoter for heterologous genes is advantageous because it provides for a sufficient level of gene expression to allow for easy detection and selection of transformed cells and provides for a high level of gene expression when desired.

The promoter in a transgene of the present invention can provide for expression of anthranilate synthase from a DNA sequence encoding anthranilate synthase. Preferably, the coding sequence is expressed so as to result in an increase in tryptophan levels within plant tissues, for example, within the seeds of the plant. In another embodiment, the coding sequence is expressed so as to result in increased tolerance of the plant cells to feedback inhibition or to growth inhibition by an amino acid analog of tryptophan or so as to result in an increase in the total tryptophan content of the cells. The promoter can also be inducible so that gene expression can be turned on or off by an exogenously added agent. For example, a bacterial promoter such as the P_(tac) promoter can be induced to varying levels of gene expression depending on the level of isothiopropylgalactoside added to the transformed bacterial cells. It may also be preferable to combine the gene with a promoter that provides tissue specific expression or developmentally regulated gene expression in plants. Many promoters useful in the practice of the invention are available to those of skill in the art.

Preferred promoters will generally include, but are not limited to, promoters that function in bacteria, bacteriophage, plastids or plant cells. Useful promoters include the CaMV 35S promoter (Odell et al., 1985), the CaMV 19S (Lawton et al., 1987), nos (Ebert et al., 1987), Adh (Walker et al., 1987), sucrose synthase (Yang et al., 1990), α-tubulin, napin, actin (Wang et al., 1992), cab (Sullivan et al., 1989), PEPCase promoter (Hudspeth et al., 1989), maize L3 oleosin promoter, P-Zm.L3 (U.S. Pat. No. 6,433,252) the 7S-alpha′-conglycinin promoter (Beachy et al., 1985) or those associated with the R gene complex (Chandler et al., 1989). Other useful promoters include the bacteriophage SP6, T3, and T7 promoters.

Plastid promoters can be also be used. Most plastid genes contain a promoter for the multi-subunit plastid-encoded RNA polymerase (PEP) as well as the single-subunit nuclear-encoded RNA polymerase. A consensus sequence for the nuclear-encoded polymerase (NEP) promoters and listing of specific promoter sequences for several native plastid genes can be found in Hajdukiewicz et al. (1997), which is hereby in its entirety incorporated by reference.

Examples of plastid promoters that can be used include the Zea mays plastid RRN (ZMRRN) promoter. The ZMRRN promoter can drive expression of a gene when the Arabidopsis thaliana plastid RNA polymerase is present. Similar promoters that can be used in the present invention are the Glycine max plastid RRN (SOYRRN) and the Nicotiana tabacum plastid RRN (NTRRN) promoters. All three promoters can be recognized by the Arabidopsis plastid RNA polymerase. The general features of RRN promoters are described by Hajdukiewicz et al. and U.S. Pat. No. 6,218,145.

Moreover, transcription enhancers or duplications of enhancers can be used to increase expression from a particular promoter. Examples of such enhancers include, but are not limited to, elements from the CaMV 35S promoter and octopine synthase genes (Last et al., U.S. Pat. No. 5,290,924). For example, it is contemplated that vectors for use in accordance with the present invention may be constructed to include the ocs enhancer element. This element was first identified as a 16 bp palindromic enhancer from the octopine synthase (ocs) gene of Agrobacterium (Ellis et al., 1987), and is present in at least 10 other promoters (Bouchez et al., 1989). It is proposed that the use of an enhancer element, such as the ocs element and particularly multiple copies of the element, will act to increase the level of transcription from adjacent promoters when applied in the context of monocot transformation. Tissue-specific promoters, including but not limited to, root-cell promoters (Conkling et al., 1990), and tissue-specific enhancers (Fromm et al., 1989) are also contemplated to be particularly useful, as are inducible promoters such as ABA- and turgor-inducible promoters, and the like.

As the DNA sequence between the transcription initiation site and the start of the coding sequence, i.e., the untranslated leader sequence, can influence gene expression, one may also wish to employ a particular leader sequence. Any leader sequence available to one of skill in the art may be employed. Preferred leader sequences direct optimum levels of expression of the attached gene, for example, by increasing or maintaining mRNA stability and/or by preventing inappropriate initiation of translation (Joshi, 1987). The choice of such sequences is at the discretion of those of skill in the art. Sequences that are derived from genes that are highly expressed in dicots, and in soybean in particular, are contemplated.

In some cases, extremely high expression of anthranilate synthase or a domain thereof, is not necessary. For example, using the methods of the invention such high levels of anthranilate synthase may be generated that the availability of substrate, rather than enzyme, may limit the levels of tryptophan generated. In such cases, more moderate or regulated levels of expression can be selected by one of skill in the art. Such a skilled artisan can readily modulate or regulate the levels of expression, for example, by use of a weaker promoter or by use of a developmentally regulated or tissue specific promoter.

Nucleic acids encoding the anthranilate synthase of interest can also include a plastid transit peptide (e.g. SEQ ID NOs: 72, 74, 114, or 147) to facilitate transport of the anthranilate synthase polypeptide into plastids, for example, into chloroplasts. A nucleic acid encoding the selected plastid transit peptide (e.g. SEQ ID NOs: 71, 73 or 146) is generally linked in-frame with the coding sequence of the anthranilate synthase. However, the plastid transit peptide can be placed at either the N-terminal or C-terminal end of the anthranilate synthase.

Constructs also include the nucleic acid of interest (e.g. DNA encoding an anthranilate synthase) along with a nucleic acid sequence that acts as a transcription termination signal and that allows for the polyadenylation of the resultant mRNA. Such transcription termination signals are placed 3′ or downstream of the coding region of interest. Preferred transcription termination signals contemplated include the transcription termination signal from the nopaline synthase gene of Agrobacterium tumefaciens (Bevan et al., 1983), the terminator from the octopine synthase gene of Agrobacterium tumefaciens, the 3′-UTR of the glutelin 1 gene of Oryza sativa (Os-gt1; SEQ ID NO: 148), and the 3′ end of genes encoding protease inhibitor I or II from potato or tomato, although other transcription termination signals known to those of skill in the art are also contemplated. Regulatory elements such as Adh intron 1 (Callis et al., 1987), sucrose synthase intron (Vasil et al., 1989) or TMV omega element (Gallie et al., The Plant Cell, 1:301 (1989)) may further be included where desired. These 3′ nontranslated regulatory sequences can be obtained as described in An (1987) or are already present in plasmids available from commercial sources such as Clontech, (Palo Alto, Calif.). The 3′ nontranslated regulatory sequences can be operably linked to the 3 terminus of an anthranilate synthase gene by standard methods. Other such regulatory elements useful in the practice of the invention are known to those of skill in the art.

A DNA construct may comprise a first expression cassette, comprised of, in operable linkage, a heterologous promoter, a DNA molecule encoding an anthranilate synthase α-domain protein and a transcriptional terminator. This DNA construct may further comprise a second expression cassette in operable linkage, comprising a heterologous promoter, a DNA molecule encoding an anthranilate synthase β-domain protein and a transcriptional terminator.

Selectable marker genes or reporter genes are also useful in the present invention. Such genes can impart a distinct phenotype to cells expressing the marker gene and thus allow such transformed cells to be distinguished from cells that do not have the marker. Selectable marker genes confer a trait that one can “select” for by chemical means, i.e., through the use of a selective agent (e.g., a herbicide, antibiotic, or the like). Reporter genes, or screenable genes, confer a trait that one can identify through observation or testing, i.e., by “screening” (e.g., the R-locus trait). Of course, many examples of suitable marker genes are known to the art and can be employed in the practice of the present invention.

Possible selectable markers for use in connection with the present invention include, but are not limited to, a neo gene (Potrykus et al., 1985) which codes for neomycin resistance and can be selected for using kanamycin, G418, and the like; a bar gene which codes for bialaphos resistance; a gene which encodes an altered EPSP synthase protein (Hinchee et al., 1988) thus conferring glyphosate resistance; a nitrilase gene such as bxn from Klebsiella ozaenae which confers resistance to bromoxynil (Stalker et al., 1988); a mutant acetolactate synthase gene (ALS) that confers resistance to imidazolinone, sulfonylurea or other ALS-inhibiting chemicals (EP 154 204 (1985)); a methotrexate-resistant DHFR gene (Thillet et al., 1988); a dalapon dehalogenase gene that confers resistance to the herbicide dalapon; or a mutated anthranilate synthase gene that confers resistance to 5-methyl tryptophan. Where a mutant EPSP synthase gene is employed, additional benefit may be realized through the incorporation of a suitable plastid transit peptide (CTP).

An illustrative embodiment of a selectable marker gene capable of being used in systems to select transformants is the genes that encode the enzyme phosphinothricin acetyltransferase, such as the bar gene from Streptomyces hygroscopicus or the pat gene from Streptomyces viridochromogenes (U.S. Pat. No. 5,550,318, which is incorporated by reference herein). The enzyme phosphinothricin acetyl transferase (PAT) inactivates the active ingredient in the herbicide bialaphos, phosphinothricin (PPT). PPT inhibits glutamine synthetase, (Murakami et al, 1986; Twell et al., 1989) causing rapid accumulation of ammonia and cell death.

Screenable markers that may be employed include, but are not limited to, a β-glucuronidase or uidA gene (GUS) which encodes an enzyme for which various chromogenic substrates are known; an R-locus gene, which encodes a product that regulates the production of anthocyanin pigments (red color) in plant tissues (Dellaporta et al., 1988); a β-lactamase gene (Sutcliffe, 1978), which encodes an enzyme for which various chromogenic substrates are known (e.g., PADAC, a chromogenic cephalosporin); a xylE gene (Zukowsky et al., 1983) that encodes a catechol dioxygenase that can convert chromogenic catechols; an α-amylase gene (Ikuta et al., 1990); a tyrosinase gene (Katz et al., 1983) that encodes an enzyme capable of oxidizing tyrosine to DOPA and dopaquinone which in turn condenses to form the easily detectable compound melanin; a β-galactosidase gene, which encodes an enzyme for which there are chromogenic substrates; a luciferase (lux) gene (Ow et al., 1986), which allows for bioluminescence detection; or even an aequorin gene (Prasher et al., 1985), which may be employed in calcium-sensitive bioluminescence detection, or a green fluorescent protein gene (Niedz et al., 1995). The presence of the lux gene in transformed cells may be detected using, for example, X-ray film, scintillation counting, fluorescent spectrophotometry, low-light video cameras, photon-counting cameras, or multiwell luminometry. It is also envisioned that this system may be developed for populational screening for bioluminescence, such as on tissue culture plates, or even for whole plant screening.

Additionally, transgenes may be constructed and employed to provide targeting of the gene product to an intracellular compartment within plant cells or in directing a protein to the extracellular environment. This will generally be achieved by joining a DNA sequence encoding a transit or signal peptide sequence to the coding sequence of a particular gene. The resultant transit, or signal, peptide will transport the protein to a particular intracellular, or extracellular destination, respectively, and may then be post-translationally removed. Transit or signal peptides act by facilitating the transport of proteins through intracellular membranes, e.g., vacuole, vesicle, plastid and mitochondrial membranes, whereas signal peptides direct proteins through the extracellular membrane. By facilitating transport of the protein into compartments inside or outside the cell, these sequences may increase the accumulation of gene product.

A particular example of such a use concerns the direction of an anthranilate synthase to a particular organelle, such as the plastid, rather than to the cytoplasm. This is exemplified by the use of the Arabidopsis SSU1A transit peptide that confers plastid-specific targeting of proteins. Alternatively, the transgene can comprise a plastid transit peptide-encoding DNA sequence or a DNA sequence encoding the rbcS (RuBISCO) transit peptide operably linked between a promoter and the DNA sequence encoding an anthranilate synthase (for a review of plastid targeting peptides, see Heijne et al. (1989); Keegstra et al. (1989). If the transgene is to be introduced into a plant cell, the transgene can also contain plant transcriptional termination and polyadenylation signals and translational signals linked to the 3′ terminus of a plant anthranilate synthase gene.

An exogenous plastid transit peptide can be used which is not encoded within a native plant anthranilate synthase gene. A plastid transit peptide is typically 40 to 70 amino acids in length and functions post-translationally to direct a protein to the plastid. The transit peptide is cleaved either during or just after import into the plastid to yield the mature protein. The complete copy of a gene encoding a plant anthranilate synthase may contain a plastid transit peptide sequence. In that case, it may not be necessary to combine an exogenously obtained plastid transit peptide sequence into the transgene.

Exogenous plastid transit peptide encoding sequences can be obtained from a variety of plant nuclear genes, so long as the products of the genes are expressed as preproteins comprising an amino terminal transit peptide and transported into plastid. Examples of plant gene products known to include such transit peptide sequences include, but are not limited to, the small subunit of ribulose biphosphate carboxylase, chlorophyll a/b binding protein, plastid ribosomal proteins encoded by nuclear genes, certain heatshock proteins, amino acid biosynthetic enzymes such as acetolactate acid synthase, 3-enolpyruvylphosphoshikimate synthase, dihydrodipicolinate synthase, anthranilate synthase and the like. In some instances a plastid transport protein already may be encoded in the anthranilate synthase gene of interest, in which case there may be no need to add such plastid transit sequences. Alternatively, the DNA fragment coding for the transit peptide may be chemically synthesized either wholly or in part from the known sequences of transit peptides such as those listed above.

Regardless of the source of the DNA fragment coding for the transit peptide, it should include a translation initiation codon, for example, an ATG codon, and be expressed as an amino acid sequence that is recognized by and will function properly in plastids of the host plant. Attention should also be given to the amino acid sequence at the junction between the transit peptide and the anthranilate synthase enzyme where it is cleaved to yield the mature enzyme. Certain conserved amino acid sequences have been identified and may serve as a guideline. Precise fusion of the transit peptide coding sequence with the anthranilate synthase coding sequence may require manipulation of one or both DNA sequences to introduce, for example, a convenient restriction site. This may be accomplished by methods including site-directed mutagenesis, insertion of chemically synthesized oligonucleotide linkers, and the like.

Precise fusion of the nucleic acids encoding the plastid transport protein may not be necessary so long as the coding sequence of the plastid transport protein is in-frame with that of the anthranilate synthase. For example, additional peptidyl or amino acids can often be included without adversely affecting the expression or localization of the protein of interest.

Once obtained, the plastid transit peptide sequence can be appropriately linked to the promoter and an anthranilate synthase coding region in a transgene using standard methods. A plasmid containing a promoter functional in plant cells and having multiple cloning sites downstream can be constructed or obtained from commercial sources. The plastid transit peptide sequence can be inserted downstream from the promoter using restriction enzymes. An anthranilate synthase coding region can then be translationally fused or inserted immediately downstream from and in frame with the 3′ terminus of the plastid transit peptide sequence. Hence, the plastid transit peptide is preferably linked to the amino terminus of the anthranilate synthase. Once formed, the transgene can be subcloned into other plasmids or vectors.

In addition to nuclear plant transformation, the present invention also extends to direct transformation of the plastid genome of plants. Hence, targeting of the gene product to an intracellular compartment within plant cells may also be achieved by direct delivery of a gene to the intracellular compartment. Direct transformation of plastid genome may provide additional benefits over nuclear transformation. For example, direct plastid transformation of anthranilate synthase eliminates the requirement for a plastid targeting peptide and post-translational transport and processing of the pre-protein derived from the corresponding nuclear transformants. Plastid transformation of plants has been described by Maliga (2002); Heifetz (2000); Bock (2001); and Daniell et al. (2002), and references within.

After constructing a transgene containing an anthranilate synthase gene, the cassette can then be introduced into a plant cell. Depending on the type of plant cell, the level of gene expression, and the activity of the enzyme encoded by the gene, introduction of DNA encoding an anthranilate synthase into the plant cell can lead to the overproduction of tryptophan, confer tolerance to an amino acid analog of tryptophan, such as 5-methyltryptophan or 6-methylanthranilate, and/or otherwise alter the tryptophan content of the plant cell.

Transformation of Host Cells

A transgene comprising an anthranilate synthase gene can be subcloned into a known expression vector, and AS expression can be detected and/or quantitated. This method of screening is useful to identify transgenes providing for an expression of an anthranilate synthase gene, and expression of an anthranilate synthase in the plastid of a transformed plant cell.

Plasmid vectors include additional DNA sequences that provide for easy selection, amplification, and transformation of the transgene in prokaryotic and eukaryotic cells, e.g., pUC-derived vectors, pSK-derived vectors, pGEM-derived vectors, pSP-derived vectors, or pBS-derived vectors. The additional DNA sequences include origins of replication to provide for autonomous replication of the vector, selectable marker genes, preferably encoding antibiotic or herbicide resistance, unique multiple cloning sites providing for multiple sites to insert DNA sequences or genes encoded in the transgene, and sequences that enhance transformation of prokaryotic and eukaryotic cells.

Another vector that is useful for expression in both plant and prokaryotic cells is the binary Ti plasmid (as disclosed in Schilperoort et al., U.S. Pat. No. 4,940,838) as exemplified by vector pGA582. This binary Ti plasmid vector has been previously characterized by An, cited supra. This binary Ti vector can be replicated in prokaryotic bacteria such as E. coli and Agrobacterium. The Agrobacterium plasmid vectors can also be used to transfer the transgene to plant cells. The binary Ti vectors preferably include the nopaline T DNA right and left borders to provide for efficient plant cell transformation, a selectable marker gene, unique multiple cloning sites in the T border regions, the colE1 replication of origin and a wide host range replicon. The binary Ti vectors carrying a transgene of the invention can be used to transform both prokaryotic and eukaryotic cells, but is preferably used to transform plant cells. See, for example, Glassman et al., U.S. Pat. No. 5,258,300.

The expression vector can then be introduced into prokaryotic or eukaryotic cells by available methods. Methods of transformation especially effective for monocots and dicots, include, but are not limited to, microprojectile bombardment of immature embryos (U.S. Pat. No. 5,990,390) or Type II embryogenic callus cells as described by Gordon-Kamm et al. (1990); Fromm et al. (1990); Walters et al. (1992), or by electroporation of type I embryogenic calluses described by D'Halluin et al. (1992) or by Krzyzek (U.S. Pat. No. 5,384,253). Transformation of plant cells by vortexing with DNA-coated tungsten whiskers (Coffee et al., U.S. Pat. No. 5,302,523) and transformation by exposure of cells to DNA-containing liposomes can also be used.

After transformation of the selected anthranilate synthase construct into a host cell, the host cell may be used for production of useful products generated by the transgenic anthranilate synthase in combination with the host cell's enzymatic machinery. Culturing the transformed cells can lead to enhanced production of tryptophan and other useful compounds, which can be recovered from the cells or from the culture media. Examples of useful compounds that may be generated upon expression in a variety of host cells and/or organisms include tryptophan, indole acetic acid and other auxins, isoflavonoid compounds important to cardiovascular health found in soy, volatile indole compounds which act as signals to natural enemies of herbivorous insects in maize, anticarcinogens such as indole glucosinolates (indole-3-carbinol) found in the Cruciferae plant family, as well as indole alkaloids such as ergot compounds produced by certain species of fungi. (Barnes et al., 1996; Frey et al., 2000; Muller et al., 2000; Mantegani et al., 1999; Zeligs, 1998; Mash et al., 1998; Melanson et al., 1997; Broadbent et al., 1998).

Accumulation of tryptophan may also lead to the increased production of secondary metabolites in microbes and plants, for example, indole containing metabolites such as simple indoles, indole conjugates, indole alkaloids, indole phytoalexins and indole glucosinalates in plants.

Anthranilate synthases insensitive to tryptophan have the potential to increase a variety of chorismate-derived metabolites, including those derived from phenylalanine due to the stimulation of phenylalanine synthesis by tryptophan via chorismate mutase (Siehl, 1999). Other chorismate-derived metabolites that may increase when feedback insensitive anthranilate synthases are present include phenylpropanoids, flavonoids, and isoflavonoids, as well as those derived from anthranilate, such as indole, indole alkaloids, and indole glucosinolates. Many of these compounds are important plant hormones, plant defense compounds, chemopreventive agents of various health conditions, and/or pharmacologically active compounds.

The range of these compounds whose synthesis might be increased by expression of anthranilate synthase depends on the organism in which the anthranilate synthase is expressed. One of skill in the art can readily assess which organisms and host cells to use and/or test in order to generate the desired compounds. The present invention contemplates synthesis of tryptophan and other useful compounds in a variety of organisms, including plants, microbes, fungi, yeast, bacteria, insect cells, and mammalian cells.

Strategy for Selection of Tryptophan Overproducer Cell Lines

Efficient selection of a desired tryptophan analog resistant, tryptophan overproducer variant using tissue culture techniques requires careful determination of selection conditions. These conditions are optimized to allow growth and accumulation of tryptophan analog resistant, tryptophan overproducer cells in the culture while inhibiting the growth of the bulk of the cell population. The situation is complicated by the fact that the vitality of individual cells in a population can be highly dependent on the vitality of neighboring cells.

Conditions under which cell cultures are exposed to tryptophan analog are determined by the characteristics of the interaction of the compound with the tissue. Such factors as the degree of toxicity and the rate of inhibition should be considered. The accumulation of the compounds by cells in culture, and the persistence and stability of the compounds, both in the media and in the cells, also need to be considered along with the extent of uptake and transmission to the desired cellular compartment. Additionally, it is important to determine whether the effects of the compounds can be readily reversed by the addition of tryptophan.

The effects of the analog on culture viability and morphology is carefully evaluated. It is especially important to choose analog exposure conditions that have no impact on plant regeneration capability of cultures. Choice of analog exposure conditions is also influenced by whether the analog kills cells or simply inhibits cell divisions.

The choice of a selection protocol is dependent upon the considerations described above. The protocols briefly described below can be utilized in the selection procedure. For example, to select for cells that are resistant to growth inhibition by a tryptophan analog, finely divided cells in liquid suspension culture can be exposed to high tryptophan analog levels for brief periods of time. Surviving cells are then allowed to recover and accumulate and are then reexposed for subsequently longer periods of time. Alternatively, organized partially differentiated cell cultures are grown and subcultured with continuous exposure to initially low levels of a tryptophan analog. Concentrations are then gradually increased over several subculture intervals. While these protocols can be utilized in a selection procedure, the present invention is not limited to these procedures.

Genes for Plant Modification

As described herein above, genes that function as selectable marker genes and reporter genes can be operably combined with the DNA sequence encoding the anthranilate synthase, or domain thereof, in transgenes, vectors and plants of the present invention. Additionally, other agronomical traits can be added to the transgenes, vectors and plants of the present invention. Such traits include, but are not limited to, insect resistance or tolerance; disease resistance or tolerance (viral, bacterial, fungal, nematode); stress resistance or tolerance, as exemplified by resistance or tolerance to drought, heat, chilling, freezing, excessive moisture, salt stress, oxidative stress, increased yields, food content and makeup, physical appearance, male sterility, drydown, standability, prolificacy, starch properties, oil quantity and quality, and the like. One may incorporate one or more genes conferring such traits into the plants of the present invention.

Insect Resistance or Tolerance

Bacillus thuringiensis (or “Bt”) bacteria include nearly 20 known subspecies of bacteria which produce endotoxin polypeptides that are toxic when ingested by a wide variety of insect species. The biology and molecular biology of the endotoxin proteins (Bt proteins) and corresponding genes (Bt genes) has been reviewed by Whitely et al. (1986) and by Hofte et al. (1989). Genes coding for a variety of Bt proteins have been cloned and sequenced. A segment of the Bt polypeptide is essential for toxicity to a variety of Lepidoptera pests and is contained within approximately the first 50% of the Bt polypeptide molecule. Consequently, a truncated Bt polypeptide coded by a truncated Bt gene will in many cases retain its toxicity towards a number of Lepidoptera insect pests. For example, the HD73 and HD1 Bt polypeptides have been shown to be toxic to the larvae of the important Lepidoptera insect pests of plants in the U.S.A. such as the European corn borer, cutworms and earworms. The genes coding for the HD1 and HD73 Bt polypeptides have been cloned and sequenced by Geiser et al. (1986) and Adang et al. (1985), respectively, and can be cloned from HD1 and HD73 strains obtained from culture collections (e.g. Bacillus Genetic Stock Center, Columbus, Ohio or USDA Bt stock collection Peoria, Ill.) using standard protocols. Examples of Bt genes and polypeptides are described, for example, in U.S. Pat. Nos. 6,329,574; 6,303,364; 6,320,100; and 6,331,655.

DNA coding for new, previously uncharacterized Bt toxins, may be cloned from the host Bacillus organism using protocols that have previously been used to clone Bt genes, and new synthetic forms of Bt toxins may also be produced.

A Bt gene useful in the present invention may include a 5′ DNA sequence including a sequence of DNA which will allow for the initiation of transcription and translation of a downstream located Bt sequence in a plant. The Bt gene may also comprise a 3′ DNA sequence that includes a sequence derived from the 3′ non-coding region of a gene that can be expressed in the plant of interest. The Bt gene would also include a DNA sequence coding for a toxic Bt polypeptide produced by Bacillus thuringiensis or toxic portions thereof or having substantial amino sequence homology thereto. The Bt coding sequence may include: (i) DNA sequences which code for insecticidal proteins that have substantial homology to Bt endotoxins that are active against insect pests of the plant of interest, e.g., the HD73 or HD1 Bt sequences; (ii) sequences coding for insecticidally-active segments of the Bt endotoxin polypeptide, e.g., insecticidally active HD73 or HD1 polypeptides truncated from the carboxy and/or amino termini; and/or (iii) a truncated Bt sequence fused in frame with a sequence(s) that codes for a polypeptide that provides some additional advantage such as: (a) genes that are selectable, e.g., genes that confer resistance to antibiotics or herbicides, (b) reporter genes whose products are easy to detect or assay, e.g., luciferase or beta-glucuronidase; (c) DNA sequences that code for polypeptide sequences that have some additional use in stabilizing the Bt protein against degradation or enhance the efficacy of the Bt protein against insects, e.g., protease inhibitors; and (d) sequences that help direct the Bt protein to a specific compartment inside or outside the plant cell, e.g., a signal sequence.

To obtain optimum synthesis of a Bt protein or other heterologous protein in the plant, it may also be appropriate to adjust the DNA sequence of the gene encoding such a protein to more resemble the genes that are efficiently expressed in the plant of interest. Since the codon usage of Bt genes may be dissimilar to that used by genes that are expressed in the plant of interest, the expression of a Bt gene in plant cells may be improved by the replacement of codons with those that are more efficiently expressed in plants, e.g., are used more frequently in the plants of interest (See Murray et al., 1989). Such replacement of codons may require the substitution of bases without changing the amino acid sequence of the resulting Bt polypeptide. The Bt polypeptide may be identical in sequence to the bacterial gene or segments thereof. The complete Bt coding sequence, or sections thereof, containing a higher proportion of preferred codons than the original bacterial gene could be synthesized using standard chemical synthesis protocols, and introduced or assembled into the Bt gene using standard protocols, such as site-directed mutagenesis or DNA polymerization and ligation and the like.

Protease inhibitors may also provide insect resistance. For example, use of a protease inhibitor II gene, pinII, from tomato or potato may be useful. Also advantageous is the use of a pinII gene in combination with a Bt toxin gene. Other genes which encode inhibitors of the insects' digestive system, or those that encode enzymes or co-factors that facilitate the production of inhibitors, may also be useful. This group includes oryzacystatin and amylase inhibitors such as those from wheat and barley.

Genes encoding lectins may confer additional or alternative insecticide properties. (Murdock et al., 1990; Czapla and Lang, 1990. Lectin genes contemplated to be useful include, for example, barley and wheat germ agglutinin (WGA) and rice lectins. (Gatehouse et al., 1984).

Genes controlling the production of large or small polypeptides active against insects when introduced into the insect pests such as lytic peptides, peptide hormones and toxins and venoms, may also be useful. For example, the expression of juvenile hormone esterase, directed towards specific insect pests, may also result in insecticidal activity, or perhaps cause cessation of metamorphosis. (Hammock et al., 1990).

Transgenic plants expressing genes encoding enzymes that affect the integrity of the insect cuticle may also be useful. Such genes include those encoding, for example, chitinase, proteases, lipases and also genes for the production of nikkomycin. Genes that code for activities that affect insect molting, such those affecting the production of ecdysteroid UDP-glucosyl transferase, may also be useful.

Genes that code for enzymes that facilitate the production of compounds that reduce the nutritional quality of the plant to insect pests, may also be useful. It may be possible, for instance, to confer insecticidal activity to a plant by altering its sterol composition. Further embodiments of the present invention concern transgenic plants with enhanced lipoxygenase activity.

The present invention also provides methods and compositions useful in altering plant secondary metabolites. One example concerns altering plants to produce DIMBOA which, it is contemplated, will confer resistance to European corn borer, rootworm and several other insect pests. See, e.g., U.S. Pat. No. 6,331,880. DIMBOA is derived from indole-related compounds. The present invention provides methods for increasing the content of indole-related compounds like tryptophan within plant cells and tissues. Hence, according to the invention the methods provided herein may also increase the levels of DIMBOA, and thereby increase the resistance of plants to insects.

The introduction of genes that can regulate the production of maysin, and genes involved in the production of dhurrin in sorghum, is also contemplated to be of use in facilitating resistance to earworm and rootworm, respectively.

Further genes encoding proteins characterized as having potential insecticidal activity may also be used. Such genes include, for example, the cowpea trypsin inhibitor (CpTI; Hilder et al., 1987) which may be used as a rootworm deterrent; genes encoding avermectin (Campbell, 1989; Ikeda et al., 1987) which may prove useful as a corn rootworm deterrent; ribosome inactivating protein genes; and genes that regulate plant structures. Transgenic plants including anti-insect antibody genes and genes that code for enzymes that can convert a non-toxic insecticide (pro-insecticide) applied to the outside of the plant into an insecticide inside the plant are also contemplated.

Environmental or Stress Resistance or Tolerance

Improvement of a plant's ability to tolerate various environmental stresses can be effected through expression of genes. For example, increased resistance to freezing temperatures may be conferred through the introduction of an “antifreeze” protein such as that of the Winter Flounder (Cutler et al., 1989) or synthetic gene derivatives thereof. Improved chilling tolerance may also be conferred through increased expression of glycerol-3-phosphate acetyltransferase in plastids (Wolter et al., 1992). Resistance to oxidative stress can be conferred by expression of superoxide dismutase (Gupta et al., 1993), and can be improved by glutathione reductase (Bowler et al., 1992).

It is contemplated that the expression of genes that favorably affect plant water content, total water potential, osmotic potential, and turgor will enhance the ability of the plant to tolerate drought and will therefore be useful. It is proposed, for example, that the expression of genes encoding for the biosynthesis of osmotically-active solutes may impart protection against drought. Within this class are genes encoding for mannitol dehydrogenase (Lee and Saier, 1982) and trehalose-6-phosphate synthase (Kaasen et al., 1992).

Similarly, other metabolites may protect either enzyme function or membrane integrity (Loomis et al., 1989), and therefore expression of genes encoding for the biosynthesis of these compounds might confer drought resistance in a manner similar to or complimentary to mannitol. Other examples of naturally occurring metabolites that are osmotically active and/or provide some direct protective effect during drought and/or desiccation include fructose, erythritol, sorbitol, dulcitol, glucosylglycerol, sucrose, stachyose, raffinose, proline, glycine, betaine, ononitol and pinitol. See, e.g., U.S. Pat. No. 6,281,411.

Three classes of Late Embryogenic Proteins have been assigned based on structural similarities (see Dure et al., 1989). Expression of structural genes from all three LEA groups may confer drought tolerance. Other types of proteins induced during water stress, which may be useful, include thiol proteases, aldolases and transmembrane transporters, which may confer various protective and/or repair-type functions during drought stress. See, e.g., PCT/CA99/00219 (Na+/H+ exchanger polypeptide genes). Genes that effect lipid biosynthesis might also be useful in conferring drought resistance.

The expression of genes involved with specific morphological traits that allow for increased water extractions from drying soil may also be useful. The expression of genes that enhance reproductive fitness during times of stress may also be useful. It is also proposed that expression of genes that minimize kernel abortion during times of stress would increase the amount of grain to be harvested and hence be of value.

Enabling plants to utilize water more efficiently, through the introduction and expression of genes, may improve the overall performance even when soil water availability is not limiting. By introducing genes that improve the ability of plants to maximize water usage across a full range of stresses relating to water availability, yield stability or consistency of yield performance may be realized.

Disease Resistance or Tolerance

Resistance to viruses may be produced through expression of genes. For example, expression of antisense genes targeted at essential viral functions or expression of genes encoding viral coat proteins may impart resistance to the virus.

Resistance to diseases caused by bacteria and fungi may be conferred through introduction of genes. For example, genes encoding so-called “peptide antibiotics,” pathogenesis related (PR) proteins, toxin resistance, and proteins affecting host-pathogen interactions such as morphological characteristics may be useful.

Mycotoxin Reduction/Elimination

Production of mycotoxins, including aflatoxin and fumonisin, by fungi associated with plants is a significant factor in rendering grain not useful. Inhibition of the growth of these fungi may reduce the synthesis of these toxic substances and therefore reduce grain losses due to mycotoxin contamination. It may be possible to introduce genes into plants such that would inhibit synthesis of the mycotoxin without interfering with fungal growth. Further, expression of a novel gene which encodes an enzyme capable of rendering the mycotoxin nontoxic would be useful in order to achieve reduced mycotoxin contamination of grain.

Plant Composition or Quality

The composition of the plant may be altered, for example, to improve the balance of amino acids in a variety of ways including elevating expression of native proteins, decreasing expression of those with poor composition, changing the composition of native proteins, or introducing genes encoding entirely new proteins possessing superior composition (see, e.g., U.S. Pat. No. 6,160,208 (alteration of seed storage protein expression)). The introduction of genes that alter the oil content of the plant may be of value (see, e.g., U.S. Pat. Nos. 6,069,289 and 6,268,550 (ACCase gene)). Genes may be introduced that enhance the nutritive value of the starch component of the plant, for example by increasing the degree of branching, resulting in improved utilization of the starch in cows by delaying its metabolism.

Plant Agronomic Characteristics

Two of the factors determining where plants can be grown are the average daily temperature during the growing season and the length of time between frosts. Expression of genes that are involved in regulation of plant development may be useful, e.g., the liguleless and rough sheath genes that have been identified in corn.

Genes may be introduced into corn that would improve standability and other plant growth characteristics. Expression of genes which confer stronger stalks, improved root systems, or prevent or reduce ear droppage would be of value to the farmer

Nutrient Utilization

The ability to utilize available nutrients may be a limiting factor in growth of plants. It may be possible to alter nutrient uptake, tolerate pH extremes, mobilization through the plant, storage pools, and availability for metabolic activities by the introduction of genes. These modifications would allow a plant to more efficiently utilize available nutrients. For example, an increase in the activity of an enzyme that is normally present in the plant and involved in nutrient utilization may increase the availability of a nutrient. An example of such an enzyme would be phytase.

Male Sterility

Male sterility is useful in the production of hybrid seed, and male sterility may be produced through expression of genes. It may be possible through the introduction of TURF-13 via transformation to separate male sterility from disease sensitivity. See Levings (1990). As it may be necessary to restore male fertility for breeding purposes and for grain production, genes encoding restoration of male fertility may also be introduced.

Selection and Characterization of Resistant Cell Lines

Selections are carried out until cells or tissue are recovered which are observed to be growing well in the presence of normally inhibitory levels of a tryptophan analog thereof. These cell “lines” are subcultured several additional times in the presence of a tryptophan analog to remove non-resistant cells and then characterized. The amount of resistance that has been obtained is determined by comparing the growth of these cell lines with the growth of unselected cells or tissue in the presence of various tryptophan analogs at various concentrations. Stability of the resistance trait of the cultured cells may be evaluated by simply growing the selected cell lines in the absence of the tryptophan analog for various periods of time and then analyzing growth after re-exposing the tissue to the analog. The resistant cell lines may also be evaluated using in vitro chemical studies to verify that the site of action of the analog is altered to a form that is less sensitive to inhibition by tryptophan analogs.

Transient expression of an anthranilate synthase gene can be detected and quantitated in the transformed cells. Gene expression can be quantitated by RT-PCR analysis, a quantitative Western blot using antibodies specific for the cloned anthranilate synthase or by detecting enzyme activity in the presence of tryptophan or an amino acid analog of tryptophan. The tissue and subcellular location of the cloned anthranilate synthase can be determined by immunochemical staining methods using antibodies specific for the cloned anthranilate synthase or subcellular fractionation and subsequent biochemical and/or immunological analyses. Sensitivity of the cloned anthranilate synthase to agents can also be assessed. Transgenes providing for expression of an anthranilate synthase or anthranilate synthase tolerant to inhibition by an amino acid analog of tryptophan or free L-tryptophan can then be used to transform monocot and/or dicot plant tissue cells and to regenerate transformed plants and seeds. Transformed cells can be selected by detecting the presence of a selectable marker gene or a reporter gene, for example, by detecting a selectable herbicide resistance marker. Transient expression of an anthranilate synthase gene can be detected in the transgenic embryogenic calli using antibodies specific for the cloned anthranilate synthase, or by RT-PCR analyses.

Plant Regeneration and Production of Seed

Transformed embryogenic calli, meristematic tissue, embryos, leaf discs and the like can then be used to generate transgenic plants that exhibit stable inheritance of the transformed anthranilate synthase gene. Plant cell lines exhibiting satisfactory levels of tolerance to an amino acid analog of tryptophan are put through a plant regeneration protocol to obtain mature plants and seeds expressing the tolerance traits by methods well known in the art (for example, see U.S. Pat. Nos. 5,990,390 and 5,489,520; and Laursen et al., (1994)). The plant regeneration protocol allows the development of somatic embryos and the subsequent growth of roots and shoots. To determine that the tolerance trait is expressed in differentiated organs of the plant, and not solely in undifferentiated cell culture, regenerated plants can be assayed for the levels of tryptophan present in various portions of the plant relative to regenerated, non-transformed plants. Transgenic plants and seeds can be generated from transformed cells and tissues showing a change in tryptophan content or in resistance to a tryptophan analog using standard methods. It is especially preferred that the tryptophan content of the leaves or seeds is increased. A change in specific activity of the enzyme in the presence of inhibitory amounts of tryptophan or an analog thereof can be detected by measuring enzyme activity in the transformed cells as described by Widholm (1972). A change in total tryptophan content can also be examined by standard methods as described by Jones et al. (1981).

Mature plants are then obtained from cell lines that are known to express the trait. If possible, the regenerated plants are self pollinated. In addition, pollen obtained from the regenerated plants is crossed to seed grown plants of agronomically important inbred lines. In some cases, pollen from plants of these inbred lines is used to pollinate regenerated plants. The trait is genetically characterized by evaluating the segregation of the trait in first and later generation progeny. The heritability and expression in plants of traits selected in tissue culture are of particular importance if the traits are to be commercially useful.

The commercial value of tryptophan overproducer soybeans, cereals and other plants is greatest if many different hybrid combinations are available for sale. The farmer typically grows more than one kind of hybrid based on such differences as maturity, standability or other agronomic traits. Additionally, hybrids adapted to one part of the country are not adapted to another part because of differences in such traits as maturity, disease, and insect resistance. Because of this, it is necessary to breed tryptophan overproduction into a large number of parental inbred lines so that many hybrid combinations can be produced.

A conversion process (backcrossing) is carried out by crossing the original overproducer line to normal elite lines and crossing the progeny back to the normal parent. Subsequent to the backcrossing, the new overproducer lines and the appropriate combinations of lines which make good commercial hybrids are evaluated for overproduction as well as a battery of important agronomic traits. Overproducer lines and hybrids are produced which are true to type of the original normal lines and hybrids. This requires evaluation under a range of environmental conditions where the lines or hybrids will generally be grown commercially. For production of high tryptophan soybeans, it may be necessary that both parents of the hybrid seed be homozygous for the high tryptophan character. Parental lines of hybrids that perform satisfactorily are increased and used for hybrid production using standard hybrid seed production practices.

The transgenic plants produced herein are expected to be useful for a variety of commercial and research purposes. Transgenic plants can be created for use in traditional agriculture to possess traits beneficial to the consumer of the grain harvested from the plant (e.g., improved nutritive content in human food or animal feed). In such uses, the plants are generally grown for the use of their grain in human or animal foods. However, other parts of the plants, including stalks, husks, vegetative parts, and the like, may also have utility, including use as part of animal silage, fermentation feed, biocatalysis, or for ornamental purposes.

Transgenic plants may also find use in the commercial manufacture of proteins or other molecules, where the molecule of interest is extracted or purified from plant parts, seeds, and the like. Cells or tissue from the plants may also be cultured, grown in vitro, or fermented to manufacture such molecules.

The transgenic plants may also be used in commercial breeding programs, or may be crossed or bred to plants of related crop species. Improvements encoded by the recombinant DNA may be transferred, e.g., from soybean cells to cells of other species, e.g., by protoplast fusion.

In one embodiment, a transgene comprised of a maize anthranilate α-domain isolated from a maize cell line tolerant to 5-MT and linked to the 35S CaMV promoter is introduced into a 5-MT sensitive monocot or dicot tissue using microprojectile bombardment. Transformed embryos or meristems are selected and used to generate transgenic plants. Transformed calli and transgenic plants can be evaluated for tolerance to 5-MT or 6-MA and for stable inheritance of the tolerance trait.

EXAMPLES

The following examples further illustrate the invention and are not intended to be limiting thereof. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventors to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the concept, spirit and scope of the invention. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the invention as defined by the appended claims.

Example 1 Isolation and E. Coli Expression of Anthranilate Synthase from Agrobacterium tumefaciens

This example describes the isolation of anthranilate synthase from Agrobacterium tumefaciens and its expression in E. coli.

Cloning of Agrobacterium tumefaciens AS

The nucleotide and amino acid sequences of the anthranilate synthase coding region from Rhizobium meliloti (GenBank accession number: P15395) was used to search an Agrobacterium tumefaciens C58 genomic sequence database (Goodner et al., 2001). The search consisted of tblastn using blosum62 matrix, (Altschul et al., 1997).

The identified AS homolog in the Agrobacterium tumefaciens C58 genomic sequence database was cloned by PCR using genomic DNA from Agrobacterium tumefaciens strain C58 (ATCC No. 33970) as the template. The primary PCR reaction was carried out using the following primers:

5′-TTATGCCGCCTGTCATCG-3′; (SEQ ID NO: 47) and 5′-ATAGGCTTAATGGTAACCG-3′. (SEQ ID NO: 48)

Gene amplification parameters were as follows: (a) denature at 95° C. for 30 seconds, (b) anneal at 50° C. for 30 seconds and (c) extend at 72° C. for 2 minutes, using Expand high fidelity PCR (Roche Biochemicals), according to manufacturer directions.

An additional round of PCR amplification, yielding a product of approximately 2.3 Kb in length, was carried out using the amplified template from above and the following nested primers:

5′-CTGAACAACAGAAGTACG-3′; (SEQ ID NO: 49) and 5′-TAACCGTGTCATCGAGCG-3′. (SEQ ID NO: 50)

The purified PCR product was ligated into pGEM-T easy (Promega Biotech, Madison, Wis.) resulting in the plasmid pMON61600. pMON61600 was sequenced using standard sequencing methodology. Confirmation of the correct sequence was obtained by comparison of the sequence the Rhizobium meliloti anthranilate synthase sequence (FIG. 1). The translated amino acid sequence from the isolated clone (SEQ ID NO: 4) shared 88% identity with the Rhizobium meliloti enzyme (SEQ ID NO: 7) (FIG. 1).

The abbreviation “AgroAS” or A. tumefaciens AS is sometimes used herein to refer to Agrobacterium tumefaciens anthranilate synthase.

E. coli Expression of Agrobacterium tumefaciens AS

The following vectors were constructed to facilitate subcloning of the Agrobacterium tumefaciens AS gene into a suitable expression vector.

A 2215 base pair PCR fragment was generated using pMON61600 as the template and the following primers:

(SEQ ID NO: 51) 5′-AAAAAGATCTCCATGGTAACGATCATTCAGG-3′; and (SEQ ID NO: 52) 5′-AAAAGAATTCTTATCACGCGGCCTTGGTCTTCGCC-3′.

The plasmid pMON61600 was digested with restriction enzymes NcoI and RsrII. In addition, a 409 bp fragment (derived by digesting the 2215 base pair PCR product with NcoI and RsrII) was then ligated into the digested pMON61600 plasmid, thereby replacing the NcoI/RsrII fragment, and resulting in a NcoI site in frame with the translation initiation codon (ATG) of Agrobacterium tumefaciens AS to yield plasmid pMON34692.

The base T7 E. coli expression plasmid, pMON34697, was generated by restriction digestion of pET30a (Novagen, Inc, Madison, Wis.)) with SphI and BamHI. The resulting 4,969 bp fragment was purified and subcloned with a 338 bp SphI and BamHI fragment from pET11d (Novagen, Inc).

The plasmid pMON34705 (FIG. 2) was generated by restriction digestion of pMON34697 with NcoI and SacI. The resulting 5,263 bp fragment was then purified and ligated with a 2,256 bp NcoI and SacI fragment from pMON34692 containing Agrobacterium tumefaciens AS.

The plasmid pMON34705 was transformed into E. coli BL21(DE3) (F-ompT HsdS_(b)(r_(B) ⁻m_(B) ⁻)gal dcm (DE3)) according to manufacturer's instructions (Novagen, Inc). DE3 is a host lysogen of λDE3 containing a chromosomal copy of T7 RNA polymerase under control of an isopropyl-1-thio-D-galactopyranoside (IPTG) inducible lacUV5.

Transformed cells were selected on kanamyacin plates that had been incubated at 37° C. overnight (10 hours). Single colonies were transferred to 2 ml of LB (Luria Broth; per liter, 10 g tryptone, 5 g yeast extract, 10 g NaCl, and 1 g glucose (optional)) or 2X-YT broth (per liter, 16 g tryptone, 10 g yeast extract, 5 g NaCl) and then placed in a 37° C. incubator and shaken at 225 rpm for 3 hours. The cells were removed and 4 μL of 100 mM IPTG was added to the culture and returned to the 37° C. incubator for an additional 2 to 3 hours. A 1 mL aliquot of the cells was removed and sonicated in sonication buffer, (50 mM potassium phosphate (pH 7.3), 10% glycerol, 10 mM 2-mercaptoethanol and 10 mM MgCl₂). The resulting lysed cell extract was the source material for the standard AS assay described below. The results established that the expression system based on plasmid pMON34705 was able to produce soluble and enzymatically active Agrobacterium tumefaciens AS protein that accounts for approximately 50% of total soluble extracted protein.

Example 2 High Trp Seed Levels are Achieved by Transformation of Plants with Wild Type Agrobacterium Anthranilate Synthase

Expression Vector pMON58120

The vector pMON58120 (FIG. 26) encodes a fusion between a 264 base pair Arabidopsis small subunit (SSU) chloroplast targeting peptide (CTP, SEQ ID NO: 71) and a 2187 base pair wild type Agrobacterium anthranilate synthase (AgroAS) open reading frame (SEQ ID NO: 1) (see Stark et al., 1992). Expression of this open reading frame is driven by the soy 7S alpha prime (7Sα′) promoter.

Upon translation on cytoplasmic ribosomes, the fusion (immature protein) is imported into chloroplast where the chloroplast targeting sequence is removed. There are two cleavage sites in the CTP1. The first site is 30 base pairs upstream of the CDS start (C/M), and the other is at the initial methionine (C/M). The second cleavage site does not seem to be processed efficiently. The cleavage is predicted to yield a mature protein of about 70 Kd that has AS activity as shown by enzyme activity data and trp efficacy data.

The AS gene was transformed with the synthetic CP4 gene that confers glyphosate resistance, however the CP4 gene is processed separately from the AS gene. Expression of the CP4 gene was driven by the FMV promoter, which is a 35S promoter from Figwort Mosaic Virus. Glyphosate resistance allows for selection of the transformed plants.

Western Analysis of AS Protein

Thirty-five transformation events of pMON58120 were analyzed for AgroAS protein presence. AgroAS protein was detected with a polyclonal antibody raised in rabbits against purified His-tagged AgroAS. The His-tagged, full-length Agro-AS polypeptide was used as an antigen to generate a population of polyclonal antibodies in rabbits by CoCalico Biological, Inc. The recombinant His-tagged Agro-AS DNA was placed into a pMON 34701 (pet-30a-agroAS) expression vector. The His-AgroAS fusion protein was expressed in E. coli BL21(DE3) and purified by Ni-NTA resin system (Qiagen protocol). For western analysis, primary rabbit anti-AgroAS antibodies were used at 1:5,000 dilution. Secondary, goat anti-rabbit alkaline phosphatase-conjugated antibodies were used at 1:5,000 dilution. In transgenic lines carrying 7Salpha′-AgroAS genes, western blot analysis consistently revealed the presence of a single band that specifically cross-reacted with anti-AgroAS antibodies. This band was not detected in the nontransgenic control line.

Free Amino Acid Analysis of Soy and Arabidopsis Seed

Amino Acid Extraction: About 50 mg of crushed soy seed (5 mg of Arabidopsis) material was placed in each centrifuge vial. One milliliter of 5% trichloroacetic acid was added to each sample (100 μl for Arabidopsis). The samples were vortexed, and allowed to sit, with agitation, at room temperature for 15 min. They were then microcentrifuged for 15 min at 14000 rpm. Some of the supernatant was then removed, placed in a HPLC vial and sealed. Samples were kept at 4° C. in the analysis queue.

Amino Acid Analysis: The reagents utilized for amino acid analysis included the OPA reagent (o-phthalaldehyde and 3-mercaptopropionic acid in borate buffer (Hewlett-Packard, PN 5061-3335)) where the borate buffer (0.4 N in water, pH 10.2). The analysis was performed using the Agilent 1100 series HPLC system as described in the Agilent Technical Publication, “Amino Acid Analysis Using Zorbax Eclipse-AAA Columns and the Agilent 1100 HPLC”, Mar. 17, 2000. First, 0.5 μl of the sample was derivatized with 2.5 μl of OPA reagent in 10 μl of borate buffer. Second, the derivative is injected onto a Eclipse XDB-C18

5 μm, 4.6×150 mm column using a flow rate of 1.2 ml/min. Amino acid concentrations were measured using fluorescence: excitation at 340 nm, emission at 450 nm. Elution was with a gradient of HPLC Buffers A and B according to Table A, where HPLC Buffer A was 40 mM Na₂HPO₄, pH=7.8 and HPLC Buffer B was 9:9:2::Methanol:Acetonitrile:Water.

TABLE A Amino Acid Elution Time 0 20 21 26 27 % Buffer B 5 65 100 100 100

Amino acid standards were prepared from the dry chemicals, using all amino acids of interest. Proline analysis required an additional derivatization step with 9-fluorenylmethyl-chloroformate (FMOC). Amino acid standards were also sometimes purchased in concentrations ranging from 0 to 100 μg/ml. Samples were reported in μg/g of seed powder.

Expression of Wild Type Agrobacterium Anthranilate Synthase in Arabidopsis

The vector pMON58120 was transformed into Arabidopsis plants by vacuum infiltration of the secondary influorescences, and plants were allowed to set transgenic seed. The seed was collected and screened for the presence of a selectable marker (glyphosate resistance). Glyphosate resistant plants were grown to maturity and seed from each plant, which was designated a transformation event, and analyzed for tryptophan content (Table B). Selected transformation events were also analyzed for the presence of the expressed Agrobacterium anthranilate synthase protein in the mature seed by Western blot analysis as shown in Table B.

TABLE B Analysis of Transformants Transformation Event Trp (ppm) Protein present 7317 2547 + 7315 2960 + 7319 3628 + 7313 3979 + Expression of Wild Type Agrobacterium Anthranilate Synthase in Soy (Glycine max)

Thirty-three out of thirty-five soy transformation events analyzed had an increase in seed trp levels, for example, from above 500 ppm and up to 12,000 ppm. In nontransgenic soy seeds, the trp level is less than 200 ppm. All seeds that contained high amounts of trp demonstrated anthranilate synthase protein expression by western blotting. Table C presents data for nineteen soy events that contain high trp levels and also are positive for anthranilate synthase anthranilate synthase protein by western blot analysis.

TABLE C Correlation between the Presence of the AgroAS Protein and Tryptophan Levels in Nineteen Soy Transgenic Events bearing pMON58120 Trp max Trp average Pedigree (ppm) (ppm) Protein present? A3244 (ctr) 306 96 NO GM_A20380:@. 6444 2246.4 YES GM_A20532:@. 6055 2556.6 YES GM_A22043:@. 10422 2557.2 YES GM_A20598:@. 8861 2859.9 YES GM_A20744:@. 7121 3373.3 YES GM_A20381:@. 6392 3572.9 YES GM_A20536:@. 9951 3581.5 YES GM_A20510:@. 8916 3592.7 YES GM_A20459:@. 8043 3900.4 YES GM_A20337:@. 7674 4088.6 YES GM_A20533:@. 9666 4183.2 YES GM_A20577:@. 6276 4434.1 YES GM_A20339:@. 9028 4687.8 YES GM_A20386:@. 8487 5285.3 YES GM_A20457:@. 11007 5888.9 YES GM_A20379:@. 7672 6416.1 YES GM_A20537:@. 9163 6695.8 YES GM_A20534:@. 12676 7618.2 YES GM_A20576:@. 10814 7870.1 YES The AgroAS Enzyme Assay

The specific activity of anthranilate synthase was measured in eleven transformation events carrying the pMON58120 construct. Individual soybean immature seeds were analyzed using an HPLC-based end-point assay based on the method described by Paulsen (1991). Briefly, desalted extracts were generated from individual seeds in grinding buffer (100 mM Tris pH7.5, 10% glycerol, 1 mM EDTA, 1 mM DTT) and incubated for 30 min with reaction buffer (100 mM tris pH 7.5, 1 mM chorismate, 20 mM glutamine, and 10 mM MgCl₂). AgroAS activity was measured in the presence or absence of 25 mM tip. The reaction was stopped with phosphoric acid and the amount of anthranilate formed was quantified by HPLC using a fluorescence detector set at 340 mm/excitation and 410 nm/emission.

The specific activity of AS in immature segregating transgenic seeds ranged from 1.5-fold up to 70-fold increase compared to a nontransgenic control, reaching as high as 6,000 pmoles/mg/min. As shown in the last column of Table D, the anthranilate synthase activity in transgenic plants is resistant to tryptophan inhibition (see Table D).

TABLE D AgroAS Enzyme Activity in Transgenic Event 20576 Specific Activity Specific Activity (pmoles/mg/min) Event Seed No. (pmoles/mg/min) (+25 micromolar Trp) Control  3244-1 95.4 42.4 Control  3244-2 85.5 40.6 20576 20576-1 6060.2 4407.1 20576 20576-2 3783.8 1709.4 20576 20576-3 2768.3 2431.7 20576 20576-4 4244.08 2125.2

Example 3 Soybean Transformation with a Vector Containing a Maize Anthranilate Synthase α-Subunit Gene

The coding sequence for a maize anthranilate synthase α-subunit was isolated from pMON52214 by digesting with XbaI in combination with a partial NcoI digest (see Anderson et al., U.S. Pat. No. 6,118,047). The resulting 1952 bp DNA fragment representing the anthranilate synthase a coding region was gel purified, and the ends were made blunt. The plasmid pMON53901 was digested with BglII and EcoRI, to generate a 6.8 Kb fragment. After isolation, the ends of the 6.8 Kb fragment were made blunt and dephosphorylated. The 1952 Kb fragment containing the ASα gene was then ligated into the blunt-ended 6.8 Kb pMON53901 fragment to generate pMON39324, a maize 7S promoter-maize ASα-NOS 3′ UTR expression vector.

This pMON39324, a maize 7S promoter-maize ASα-NOS 3′ UTR cassette, was subsequently digested with BamHI resulting in a 2.84 Kb DNA fragment, containing the 7S promoter and maize ASα coding sequence. The plasmid pMON39322 was digested with BamHI resulting in a 5.88 kb DNA fragment. These two fragments were then ligated together to create pMON39325 (FIG. 8), a transformation vector containing 7S promoter-maize ASα-NOS 3′ UTR cassette subcloned into pMON39322.

Using similar procedures, the coding sequence for a maize anthranilate synthase α-subunit was cloned downstream from the USP promoter to generate a pMON58130 expression vector, downstream from the Arc5 promoter to generate a pMON69662 expression vector, downstream from the Lea9 promoter to generate a pMON69650 expression vector, and downstream from the Per1 promoter to generate a pMON69651 expression vector. A list with these expression vectors is presented in Table E.

TABLE E C28-Maize Anthranilate Synthase Constructs Seed Generation Expression Cassette Vector Name R4 7Sa′-maize ASα PMON39325 R2 Napin-maize ASα PMON58023 R1 USP-maize ASα PMON58130 R1 Arc5-maize ASα PMON69662 R1 Lea9-maize ASα PMON69650 R1 Per1-maize ASα PMON69651

These vectors were used for plant transformation and propagation experiments. Soybean plants were transformed with the maize AS-containing vectors using the microprojectile bombardment technology as described herein. Several transgenic soybean lines were established for each type of vector and propagated through the number of generations indicated in Table E.

For example, three homozygous lines were established that carried the 7Sα′-maize AS transgene from pMON39325. These three lines were grown in a randomized block design in two different locations. Mature seed was produced and analyzed for free amino acid content. Controls were included to establish baseline trp levels, i.e. the three corresponding negative isolines and the nontransgenic controls.

Table F provides R4 seed tryptophan in ppm for pMON39325 transformant and control lines, showing that the average non-transgenic soybeans contain about 100-200 μg tryptophan/g seed powder whereas the pMON39325 transformants contain substantially more Trp. See also, FIG. 9.

TABLE F Trp Levels in seeds of Soybean Plants Transformed with the C28 Zea mays mutant (pMON39325) Average trp of Average trp corresponding Positive isoline of Positive Standard Negative Standard number Isoline (ppm) deviation isoline (ppm) deviation 39325-1 3467 377 226 55 35325-2 2623 307 164 20 35325-3 3715 152 184 64 35325-4 2833 165 202 146 35325-5 3315 161 173 34 35325-6 2394 318 144 22 nontransgenic 191 24 control-7 nontransgenic 118 23 control-8

Five other constructs, expressing the C28 maize anthranilate synthase under the control of five different promoters (Table E) were transformed into soy and transgenic plants were obtained. Each construct generated events high in trp. An example illustrating events generated by Per1-C28 maize anthranilate synthase is shown in Tables G and H.

TABLE G C28 maize AS Protein Expression Correlates with Increased Trp Levels in Three Transgenic Events bearing Per1-C28 maize AS (pMON69651) Pedigree Trp average (ppm) Protein present? Control 96 No 22689 2375 Yes 22787 1707 Yes 22631 1116 Yes

Table H illustrates the enzymatic activity of C28 maize AS in R1 seeds from soybean plants transformed with the pMON69651 expression vector.

TABLE H Specific Activity of C28 maize AS in R1 Seeds of pMON69651 Transformants Specific activity Seed Specific activity (pmoles/mg/min) Event number (pmoles/mg/min) (+25 micromolar tryptophan) Control 51.6 2.6 22689 22689-1 130.9 64.7 22689-2 115.3 22689-3 148.5 61.1 22689-4 149.5 22698-5 133.8 60.3

These results indicate that there is a substantial increase in tryptophan when soybean plant tissues are transformed with the C28 maize AS gene.

The high trp levels shown in Table G correlate with the presence of the AS protein and with increased specific activity (2.5 fold higher than in nontransgenic controls) for the transgenic enzyme (Table H). As shown in Table H—and as predicted by the biochemical properties of the C28 maize AS enzyme—the specific activity of transgenic events is tryptophan-resistant.

Example 4 Rational Design of Agrobacterium tumefaciens Anthranilate Synthase Tryptophan Feedback Insensitive Mutants

This example describes vectors containing mutant Agrobacterium tumefaciens anthranilate synthase enzymes that have various degrees of sensitivity or insensitivity to feedback inhibition by tryptophan or tryptophan analogs.

Generation of Agrobacterium tumefaciens Mutant Anthranilate Synthase Genes

Using protein structural information from Solfolobus solfataricus anthranilate synthase as a guide (Knochel et al., 1999) several Agrobacterium tumefaciens anthranilate synthase mutants were rationally designed utilizing protein informatics to confidently assign several residues involved in tryptophan binding. This was accomplished by alignment of the Agrobacterium tumefaciens anthranilate synthase gene with the anthranilate synthase amino acid sequence from Sulfolobus solfataricus (FIG. 3). The putative tryptophan binding and catalysis regions of the Agrobacterium tumefaciens were assigned by combining the knowledge of the structural information with the sequence homology. Residues in the binding pocket were identified as potential candidates for altering to provide resistance to feedback inhibition by tryptophan.

Based on the structural analysis of the Sulfolobus solfataricus anthranilate synthase enzyme, it suggested that amino acids E30, S31, I32, S42, V43, N204, P205, M209, F210, G221, and A373 were involved in tryptophan binding. Based on the pairwise alignment, N₂₀₄, P205, and F210 of Sulfolobus solfataricus were also conserved in the monomeric Agrobacterium tumefaciens anthranilate synthase as residues N292, P293, and F298 respectively.

However, due to multiple insertions and deletions, the N-terminal regions of the Sulfolobus solfataricus and Agrobacterium tumefaciens enzymes were highly divergent. For this reason, it was necessary to manually assign residues at the N-terminal region of the Agrobacterium tumefaciens anthranilate synthase involved in tryptophan regulation (FIG. 3). Structural analysis indicated that the motif “LLES” formed a β sheet in the tryptophan-binding pocket. This structure appeared to be highly conserved among the heterotetrameric enzymes. The known monomeric enzymes were then manually aligned to the Sulfolobus solfataricus sequence using the “LLES” motif as a landmark (FIG. 7). Based on this protein informatics analysis, amino acid residues V48, S50, S51, and N52 in Agrobacterium tumefaciens AS were also likely to be involved in tryptophan binding.

With the putative tryptophan binding residues assigned in the Agrobacterium tumefaciens monomeric enzyme, several distinct strategies were rationalized for reducing the sensitivity of the enzyme to tryptophan inhibition. These substitutions included for example, enlarging the tryptophan-binding pocket (F298A), narrowing the binding pocket (V48F, V48Y, S51F, S51C, N52F, F298W), increasing the polarity of the binding pocket (S50K), or distorting the shape of the binding pocket by changing the protein main chain conformation (P293A, P29G).

A. tumefaciens AS Site-Directed Mutagenesis

Site directed mutagenesis was used to generate ten single amino acid substitutions six sites. The mutations were introduced into the Agrobacterium tumefaciens AS in pMON34705 using the QuikChange™ Site-Directed Mutagenesis Kit (Stratagene, Cedar Creek, Tex.). The primers used for site directed mutagenesis were SEQ ID NOs: 9-42 (FIG. 4; F=forward, R=reverse). Each primer sequence is specific for alteration of the nucleic acid at a specific location in the sequence and thus changing the encoded codon to code for a new amino acid. For example, S51C designates a change from serine to cysteine at amino acid position 51 in the Agrobacterium tumefaciens AS peptide sequence.

Following mutagenesis the sequence of the entire gene was reconfirmed and the variants expressed and purified from E. coli as described below for the wild type enzyme. The resultant plasmids comprising mutant Agrobacterium tumefaciens AS are suitably cloned into a plasmid for overproduction of protein using the T7 expression system as described in Example 1.

Agrobacterium tumefaciens AS Protein Expression and Purification

Agrobacterium tumefaciens AS wild type and mutant enzymes were expressed in E. coli as described in Example 1. The purification of all the Agrobacterium tumefaciens AS enzymes, including wild type and mutants thereof, was performed at 4° C. The cells (approximate wet weight of 1 g) were suspended in 20 ml of purification buffer (50 mM potassium phosphate, pH 7.3, 10 mM MgCl₂, 10 mM 2-mercaptoethanol, 10% glycerol) and lysed by ultrasonication (Branson sonifier Cell Disruptor, W185). Supernatant was collected after centrifugation of the homogenate at 20,000×g for 15 min. The supernatant was subjected to ammonium sulfate fractionation (30 to 65% saturation). The precipitate was collected after centrifugation at 20,000×g for 15 min and dissolved in 3 ml of the purification buffer and then loaded as a whole on an Econo-Pac 10DG desalting column, pre-equilibrated with the same buffer. Fractions containing the enzyme were detected by the developed assay and pooled. The pooled enzyme (4.3 mls) was loaded on a 10 ml DEAE Sephacel (Pharmacia Biotech) column (1.5×7.5 cm) equilibrated with the same buffer. The column was washed with 30 ml of the purification buffer and the enzyme was eluted with 30 ml of 50 mM NaCl in the same buffer. Fractions containing high AS activity were pooled and precipitated by 65% ammonium sulfate saturation and isolated and desalted as above. Fractions containing the enzyme were pooled and stored at −80° C.

Anthranilate Synthase Enzyme Assay and Kinetic Analysis

The standard assay for Agrobacterium tumefaciens AS was performed at 25° C. in an assay buffer containing 100 mM potassium phosphate, pH 7.0, 10 mM MgCl₂, 1 mM dithiothreitol, 200 μM chorismate and 10 mM L-glutamine. The reaction was started by adding 30 μl of enzyme to the reaction mixture and mixing. The formation of anthranilate was directly monitored by the absorbance increase at 320 m for 3 minutes. Initial rate of reaction was calculated as unit absorbance increase per second based on the slope of the absorbance change over the reaction time. K_(m) for chorismate (K_(m) ^(Cho)) was determined in the total volume of 1 ml assay buffer containing 100 mM potassium phosphate, pH 7.0, 10 mM MgCl₂, 1 mM dithiothreitol with 10 mM L-glutamine and varying the concentration of chorismate between 2.5-100 μM chorismate. The K_(m) for glutamine (K_(m) ^(Gln)) was determined in the total volume of 1 ml assay buffer containing 100 mM potassium phosphate, pH 7.0, 10 mM MgCl₂, 1 mM dithiothreitol with 200 μM chorismate and varying the concentration of L-glutamine between 0.1-2 mM L-glutamine. IC₅₀ for tryptophan (IC₅₀ ^(Trp)) was determined with in the total volume of 1 ml assay buffer containing 100 mM potassium phosphate, pH 7.0, 10 mM MgCl₂, 1 mM dithiothreitol, 10 mM L-glutamine, 200 μM chorismate and varying the concentration of L-tryptophan between 0.1-10 mM L-tryptophan. Kinetic parameters and IC₅₀ of AS were calculated after fitting the data to a non-linear regression program (GraFit; Erithacus Software: Surrey, UK).

Several mutants demonstrated reduced sensitivity to tryptophan inhibition while still maintaining enzymatic activity comparable to the wild type enzyme (Table I). These results demonstrate that the extent of sensitivity to tryptophan inhibition can be decreased, for example, by mutating amino acids in the tryptophan-binding pocket of anthranilate synthase and by optimizing of the mutations demonstrating feedback insensitivity.

TABLE I Anthranilate Synthase Activity and Effect of Tryptophan on Agrobacterium tumefaciens AS Mutants K_(m) ^(Cho) K_(m) ^(Gln) k_(cat)/K_(m) ^(Cho) IC₅₀ ^(Trp) Mutation Codon (μM) (mM) k_(cat)(s⁻¹) (μM⁻¹s⁻¹) (μM) WT 8.0 0.11 0.43 5.37 × 10⁻² 5 V48F TTT 4.5 0.08 0.24 5.33 × 10⁻² 150 V48Y TAT 4.2 0.10 0.18 4.28 × 10⁻² 650 S50K AAG 13 0.01 0.13 1.00 × 10⁻² 0.1 S51F TTC 10 0.06 0.08 0.80 × 10⁻² >32,000 S51C TGC 2.8 0.08 0.15 5.36 × 10⁻² 1,500 N52F TTC 5.5 0.04 0.21 3.82 × 10⁻² 41 P293A GCG 24 0.16 0.35 1.46 × 10⁻² 14 P293G GGG 33 0.07 0.48 1.45 × 10⁻² 17 F298A GCC 9.2 0.10 0.46 5.00 × 10⁻² 5.5 F298W TGG 18 0.14 0.44 2.44 × 10⁻² 450

Example 5 Random Mutagenesis of Agrobacterium tumefaciens AS to Generate Tryptophan Feedback Insensitive Mutants

In addition to the rational design approaches described in Example 4, other strategies to generate feedback insensitive mutants of anthranilate synthase include, but are not limited to, random mutagenesis. Random mutagenesis of the Agrobacterium tumefaciens AS, can be accomplished, for example, by chemical mutagenesis (isolated DNA or whole organism), error prone PCR, and DNA shuffling. This example describes the use of chemical mutagenesis followed by genetic selection. The genetic selection approach is also useful for selection of desirable mutants derived from other mutagenesis techniques.

Generation of E. coli Expression Plasmid Containing A. tumefaciens AS

The open reading frame from the Agrobacterium tumefaciens AS clone pMON61600 (SEQ ID NO: 1, described in Example 1) was amplified by PCR using primers that contain an Nco 1 site on the 5′ end of the forward primer and an XbaI site on the 3′ end of the reverse primer:

5′-CATCCCATGGATGGTAACGATCATT CAGGAT-3′ (SEQ ID NO: 55); and

5′-GATGTCTAGAGACACTATAGAATACTCAAGC-3′ (SEQ ID NO: 56).

The resulting PCR product was ligated into pMON25997, which had the bktB open reading frame (Slater et al., 1998) removed by digestion with BspH1 and Xba1 resulting in plasmid pMON62000. pMON62000 is the base plasmid used for mutagenesis and complementation of the tryptophan auxotroph (EMG2ΔtrpE).

Generation of an E. coli Tryptophan Auxotroph EMG2ΔtrpE

E. coli strain Ec-8 (EMG2ΔtrpE) was constructed using the suicide vector pKO3 to delete 1,383 base pairs from the chromosomal trpE gene of E. coli strain EMG2(K-12 wt F+) (E. coli Genetic Stock Center). Two amplicons from E. coli genomic DNA were PCR amplified. The first amplicon was approximately 1.5 kb and contained the first 30 bp of the trpE ORF at the 3′ end. This amplicon contains a BamH1 site at the 5′ end and an EcoR1 site at the 3′ end. The second amplicon was approximately 1 kb and contained the last 150 bp of the trpE ORF at the 5′ end. This amplicon contains an EcoR1 site at the 5′ end and a Sal1 site at the 3′ end. The two amplicons were digested with the appropriate enzymes and ligated together at the EcoR1 site to create an in-frame deletion of trpE. FIG. 10 shows the resulting sequence of the truncated gene (SEQ ID NO: 46). The trpE deletion amplicon was ligated into pKO3 at the BamH1 and Sal1 sites. Gene disruption was performed as described in Link et al. (1997).

Complementation of E. coli Tryptophan Auxotroph EMG2ΔtrpE with pMON62000

E. coli strain Ec-8 (EMG2ΔtrpE) was transformed with pMON62000 and plated on M9 minimal medium to determine if the deletion was complemented by the addition of pMON62000. A plasmid control (minus the Agrobacterium tumefaciens AS insert) and a strain control Ec-8 were also plated onto M9 minimal medium and onto M9 minimal medium with 40 μg/ml tryptophan. Growth of strain Ec-8 transformed with pMON62000 was observed on M9 without tryptophan, no growth of either of the controls was observed, indicating complementation of the trpE deletion in strain Ec-8 by pMON62000.

Hydroxylamine Mutagenesis of pMON62000 and Genetic Selection of Mutants

To generate mutants of anthranilate synthase, pMON62000 was mutated with the chemical mutagen hydroxylamine. The following ingredients were combined in an eppendorf tube: 20 μg pMON62000 plasmid DNA and 40 μl 2.5 M hydroxylamine, pH 6.0. The volume was brought to a volume of 200 μl with 0.1M NaH₂PO₄, pH6.0+5 mM EDTA, pH 6.0. The tube was incubated at 70° C. After 1.5 hours, 100%1 of reaction mixture was dialyzed on a nitrocellulose filter that was floating on approximately 500 ml H₂O. After 15 minutes, the DNA was concentrated using Qiagen PCR Purification Kit. After 3 hours, the remaining 100 μl of the reaction mixture was removed and purified in the same manner.

E. coli strain Ec-8 was then transformed by electroporation with 100 ng of pMON62000 that had been mutagenized for either 1.5 or 3 hours with hydroxylamine. Two transformation procedures were performed for each time point. Transformed cells were allowed to recover for 4 or 6 hours in SOC medium (20 g/L Bacto-Tryptone, 5 g/L Bacto Yeast Extract, 10 ml/L 1M NaCl, 2.5 ml/L 1M KCl, 18 g glucose).

Two 245 mm square bioassay plates were prepared containing M9 minimal medium, plus 2% agar, and 50 μg/ml 5-methyl-DL-tryptophan (5-MT). An aliquot of 900 μl of the 1.5 hour mutagenized transformation mixture was plated onto one 50 μg/ml 5-MT plate. The remaining 100 μl was plated onto the M9 control plate. The same procedure was performed for the transformation mixture containing the 3.0 hour mutagenized plasmid.

The plates were then incubated at 37° C. for approx. 2.5 days. Resistant colonies were isolated from the 5-MT plates and were streaked onto LB-kanamycin (50 μg/ml) plates to confirm the presence of the plasmid. All of the selected colonies grew on these plates. Individual colonies from each of the resistant clones were prepped in duplicate to isolate the plasmid. Restriction digests and PCR were performed and confirmed that all the clones contained the desired Agrobacterium tumefaciens AS insert.

The rescued plasmids were then transformed back into strain Ec-8. One colony from each transformation was purified by streaking onto new LB-Kanamycin plates. To confirm resistance to 5-MT, individual purified colonies were streaked onto plates containing M9 plus 50 μg/ml 5-MT and 2% agar, and then grown at 37° C. for 3 days. Resistance was confirmed for most of the clones. To determine if resistant mutants would remain resistant at an even higher concentration of 5-MT, they were plated onto M9 plus 300 μg/ml 5-MT and 2% Agar. Most clones demonstrated resistance at this high concentration also.

The plasmids from all of the resistant clones were isolated and sequenced on both strands. Some of the mutations from this experiment are diagrammed in Table J.

TABLE J A. tumefaciens trpEG Sequence Variations in 5-MT Resistant Clones. Database Original Determined Sequence K_(m) ^(cho) IC₅₀ ^(trp) Clone # Clone # Variations (μM) (μM) Wt 8.0 5.0 Ec-12 1 G4A Val2Ile Ec-18 8 C35T Thrl2Ile 15 2.5 Ec-19 9 C2068T Pro690Ser 5.0 3.4 Ec-20 11 G1066A Glu356Lys & C1779T Ile593Ile

As indicated by the data in Table J, several mutants had little effect on the K_(m) ^(cho) and IC₅₀ ^(trp) of the mutant enzyme, indicating that these mutations are likely not the source of resistance to tryptophan feedback inhibition. For example, the mutation of C to T at nucleotide 35, which changes a threonine residue to isoleucine at amino acid position 12 (Thr12Ile), gives rise to a minor change in K_(m) ^(cho) and IC₅₀ ^(trp) values. Similarly, a change of C to T at nucleotide position 2068, which changes a proline to a serine also gives rise to a minor change in K_(m) ^(cho) and IC₅₀ ^(trp) values. These mutations may therefore, may be “silent” mutations that give rise to variant gene products having enzymatic properties like those of wild type.

Example 6 High Tryptophan Transgenic Soybean Plants

This example sets forth preparation of transgenic soybean plants having elevated tryptophan levels resulting from transformation with tryptophan feedback insensitive mutants of anthranilate synthase from Agrobacterium tumefaciens.

Vector Construction

Plasmid pMON34711, which harbors the anthranilate synthase clone from Agrobacterium tumefaciens containing the F298W mutation described in Example 4, was digested with restriction enzyme NotI. The ends of the resulting fragment were blunted and then digested with NcoI. The plasmid pMON13773 was then digested with restriction enzyme EcoRI, the ends blunted and then digested with NcoI. The resulting fragments were ligated resulting in plasmid pMON58044, which contained the AS gene under the control of the 7S promoter and NOS 3′ UTR.

Plasmid pMON58044 was then cut with restriction enzymes BglII and NcoI and ligated with a fragment that was generated by digesting pMON53084 with BglII and NcoI. The resulting fragment was named pMON58045 and contained the sequence for the Arabidopsis SSU1A transit peptide.

Finally, plasmid pMON58046 (FIG. 5) was constructed by ligating the fragments generated by digesting pMON58045 and pMON38207 with restriction enzyme NotI. This resulted in the pMON58046 vector (FIG. 5) that was used for soybean transformation.

Soybean Transformation By Microprojectile Bombardment

For the particle bombardment transformation method, commercially available soybean seeds (i.e., Asgrow A3244, A4922) were germinated overnight for approximately 18-24 hours and the meristem explants were excised. The primary leaves were removed to expose the meristems and the explants were placed in targeting media with the meristems positioned perpendicular to the direction of the particle delivery.

The pMON58046 transformation vector described above was precipitated onto microscopic gold particles with CaCl₂ and spermidine and subsequently resuspended in ethanol. The suspension was coated onto a Mylar sheet that was then placed onto the electric discharge device. The particles were accelerated into the plant tissue by electric discharge at approximately 60% capacitance.

Following bombardment, the explants were placed in selection media (WPM+0.075 mM glyphosate) (WPM=Woody Plant Medium (McCown & Lloyd, 1981) minus BAP)) for 5-7 weeks to allow for selection and growth of transgenic shoots. Phenotype positive shoots were harvested approximately 5-7 weeks post-bombardment and placed into selective rooting media (BRM+0.025 mM glyphosate) (see, below for BRM recipe) for 2-3 weeks. Shoots producing roots were transferred to the greenhouse and potted in soil. Shoots that remained healthy on selection, but did not produce roots were transferred to non-selective rooting media (BRM without glyphosate) for an additional 2 weeks. The roots from any shoots that produced roots off the selection were tested for expression of the plant selectable marker before transferring to the greenhouse and potting in soil. Plants were maintained under standard greenhouse conditions until R1 seed harvest.

The recipe used for Bean Rooting Medium (BRM) is provided in Table K below.

TABLE K Recipe for Bean Rooting Medium. Compound Quantity for 4 L MS Salts*** 8.6 g Myo-inositol(cell culture grade) 0.40 g SBRM Vitamin Stock** 8.0 ml L-Cysteine (10 mg/ml) 40.0 ml Sucrose (ultra pure) 120 g Adjust pH to 5.8 Washed Agar 32 g Additions after autoclaving: SBRM/TSG Hormone Stock* 20.0 ml *SBRM/TSG Hormone Stock (per 1 L of BRM): 3.0 ml IAA (0.033 mg/ml), 2.0 ml sterile distilled water. Store stock in dark at 4° C.. **SBRM Vitamin Stock (per 1 L of stock): Glycine (1.0 g), Nicotinic Acid (0.25 g), Pyridoxine HCl (0.25 g), Thiamine HCl (0.25 g). ***3× MInor MS Salts (per 1 L stock): H₂BO₃ (1.86 g), MnSO₄ (5.07 g), ZnSO₄—H₂O (2.58 g), KI (0.249 g), 7.5 ul NaMoO—2H₂O (1.0 mg/ml), 7.5 ul CoSO₄—5H₂O (1.0 mg/ml), 7.5 ul CoCl₂—6H₂O (1.0 mg/ml).

One ingredient at a time was added and dissolved, the volume was brought to one liter with sterile distilled water, and the solution was stored in a foil-covered bottle in the refrigerator for no longer than one month.

Soybean Transformation Using Agrobacterium tumefaciens

For the Agrobacterium transformation method, commercially available soybean seeds (Asgrow A3244, A4922) were germinated overnight (approximately 10-12 hours) and the meristem explants were excised. The primary leaves may or may not have been removed to expose the meristems and the explants were placed in a wounding vessel.

Agrobacterium strain ABI containing the plasmid of interest was grown to log phase. Cells were harvested by centrifugation and resuspended in inoculation media containing inducers. Soybean explants and the induced Agrobacterium culture were mixed no later than 14 hours from the time of initiation of seed germination and wounded using sonication.

Following wounding, explants were incubated in Agrobacterium for a period of approximately one hour. Following this inoculation step, the Agrobacterium was removed by pipetting and the explants were placed in co-culture for 2-4 days. At this point, they were transferred to selection media (WPM+0.075 mM glyphosate+antibiotics to control Agrobacterium overgrowth) for 5-7 weeks to allow selection and growth of transgenic shoots.

Phenotype positive shoots were harvested approximately 5-7 weeks post-bombardment and placed into selective rooting media (BRM+0.025 mM glyphosate) for 2-3 weeks. Shoots producing roots were transferred to the greenhouse and potted in soil. Shoots that remained healthy on selection, but did not produce roots were transferred to non-selective rooting media (BRM without glyphosate) for an additional 2 weeks. The roots from any shoots that produced roots off the selection were tested for expression of the plant selectable marker glyphosate resistance before transferring to the greenhouse and potting in soil. Plants were maintained under standard greenhouse conditions until R1 seed harvest.

Analysis of Amino Acid Content of R1 Seed

Mature R1 seed is produced and analyzed for free amino acid content using fluorescence detection as described in Agilent Technologies Technical Bulletin REV14. Five seeds are chosen for single seed analysis from each event. Soy seeds expressing AgroAS F298W, AgroAS S51F, AgroAS V48F, AgroAS V48Y or AgroAS S51C mutant proteins generate very high amounts of tryptophan. The highest levels of tryptophan have a negative impact on germination. Results are shown in Tables, L, M and N.

TABLE L Protein expression in Seeds Transformed with pMON58046 Pedigree Trp average (ppm) Protein present? Control 96 no 22817 9922 yes 22891 12955 yes 23026 7968 yes

TABLE M AS Protein expression Correlated with pMON58123 Transformation Pedigree Trp average (ppm) Protein present? Control 96 No 23562 88 No 23590 8795 Yes 23911 388 No

TABLE N Average and max trp levels in soybeans carrying one of the following AgroAS alleles: V48F (pMON66877), V48Y (pMON66878) and S51C (pMON66879). PMON Event Average of Max of trp number Description number trp (ppm) (ppm) 66877 7Salpha-V48F AgroAS 26640 12,283 28,342 66877 7Salpha-V48F AgroAS 26641 5,588 14,579 66877 7Salpha-V48F AgroAS 26642 11,833 18,712 66878 7Salpha-V48Y AgroAS 26872 6,015 11,902 66878 7Salpha-V48Y AgroAS 26875 12,361 17,181 66878 7Salpha-V48Y AgroAS 27010 13,962 19,323 66879 7Salpha-S51C AgroAS 27105 12,614 31,827 66879 7Salpha-S51C AgroAS 27300 16,711 34,263 66879 7Salpha-S51C AgroAS 27568 10,135 20,237 AS Enzyme Activity in R1 Seed Transformed with AgroAS

Mature R1 seed is produced and analyzed for anthranilate synthase activity. Anthranilate synthase enzymatic activity was determined in R1 soy seeds carrying the AgroAS F298W (SEQ ID NOs: 65 or 91) or the AgroAS S51F (SEQ ID NOs: 60 or 86) mutant alleles. Very high levels of tryptophan-resistant anthranilate synthase activity was observed, consistent with the high amounts of tryptophan generated by these seeds. Results are shown in Tables O and P.

TABLE O Specific activity of AS in R1 Seeds Transformed with pMON58046 Specific activity Seed Specific activity (pmoles/mg/min) Event number (pmoles/mg/min) (+25 micromolar Trp) Control 77.6 23076 23076-1 100.5 1.04 23076-2 4512.8 23076-3 9737.4 9290.4 23076-4 136.12 23076-5 8992.5 9749.9

TABLE P Specific activity of AS in R1 Seeds Transformed with pMON58123 Specific activity Seed Specific activity (pmoles/mg/min) Event number (pmoles/mg/min) (+25 micromolar Trp) Control 83.7 32.7 23590 23590-1 891 692.3 23590-2 466.2 186.5 23590-3 71.7 38.3 23590-4 320.5 316.2

Example 7 Preparation of Transformation Vector Comprising Ruta graveolens Anthranilate Synthase α-Subunit

The anthranilate synthase α gene from Ruta graveolens (Genbank Accession No. GI 960291) provides another anthranilate synthase domain useful in the present invention (Bohlmann, 1996). One isoenzyme of anthranilate synthase present in the genome of Ruta graveolens demonstrates less susceptibility to feedback inhibition by L-tryptophan. This allele may also be useful in the present invention to elevate the levels of free L-tryptophan in transgenic plants. The vector pMON58030 contains the Ruta graveolens anthranilate synthase α-subunit that is less sensitive to tryptophan inhibition. The Ruta graveolens anthranilate synthase a gene was PCR amplified from pMON58030 to provide a BamHI site at the 5′ end and a BglII site at the 3′ end of the Ruta graveolens anthranilate synthase a gene fragment by utilizing PCR primers that contained these two restriction enzyme sites:

5′-CAAAAGCTGGATCCCCACC-3′; (SEQ ID NO: 53) and 5′-CCTATCCGAGATCTCTCAACTCC-3′. (SEQ ID NO: 54)

The PCR fragment was purified, digested with the respective restriction enzymes, to form pMON58041, which contains the transcriptional fusion of the Ruta graveolens ASα to the napin promoter. The Agrobacterium mediated plant transformation plasmid, pMON58043, was created comprising the napin promoter, Ruta graveolens AS, NOS terminator, glyphosate resistance (CP4) selectable marker and borders suitable for proper chromosomal integration of the cassette as described. The resulting plant transformation vector was used to transform plants using standard plant transformation techniques as described in Examples 2, 3, and 6.

Example 8 Transforming Multi-Polypeptide Anthranilate Synthases into Monomeric Single Polypeptide Anthranilate Synthases

Generation of a monomeric anthranilate synthase by fusion of selected multi-subunit enzymes is desirable, for example, to maximize the catalytic efficiency, to stabilize the enzyme, to achieve coordinated expression, for example, of subunits comprising activities of TrpE and TrpG and for effective communication between the two subunits. In some instances, it may be useful to employ TrpE or α-subunits from either plant or microbial source that are deregulated with respect to feedback inhibition by standard mutagenesis techniques or by rational design as described in the foregoing Examples, e.g. in Example 4. In other instances, wild type TrpE or α-subunits from either plant or microbial source are employed.

The C-terminus of the selected TrpE or α-subunit is linked to the N-terminus of the TrpG subunit or β-subunit, preferably with a peptide linker. A linker can be rationally designed to provide suitable spacing and flexibility for both subunits to properly align. Alternatively a linker can be identified by sequence alignment of monomeric and heterotetrameric anthranilate synthases. Examples of sequence alignments of monomeric and heterotetrameric anthranilate synthase forms are shown in FIGS. 7 and 15. It is also envisioned that it may be necessary to generate monomeric anthranilate synthases comprising heterologous subunit in order to maximize the benefits. For example, an α-subunit may be obtained from a bacterial source, for example, E. coli and fused to a β-subunit from a plant source, for example, Arabidopsis.

The novel protein produced can be introduced into plants, for example, as described in Examples 2, 3, or 6. The present invention is not limited to the exact details shown and described, for it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the present invention defined by the claims.

Example 9 Identification of Anthranilate Synthases from Genomic Sequence Databases

Monomeric anthranilate synthases as well as α and β domains useful in the invention can be identified by bioinformatics analysis by searching for example, genbank and/or swissprot databases using BLAST (www.ncbi.nlm.nih.gov/blast/). Useful query sequences to identify monomeric anthranilate synthase include, for example, domains of anthranilate synthase such as the α-domain (GI 1004323) or β-domain (GI 1004324) from Sulfolobus solfataricus, or monomeric anthranilate synthase such as Agrobacterium tumefaciens AS (GI 15889565). Putative monomeric anthranilate synthase will have between 50% and 100% homology with the query sequence and should minimally contain 700 amino acids. If the AS-α-domain is used to query the genomic database, in addition to identifying putative anthranilate synthase genes it is also likely to identify genes involved in PABA synthesis for example 4-amino-4-deoxychorismate (ADC) synthase. The monomeric ADC synthase genes can be easily identified away from putative monomeric AS genes based on the observation that the amidotransferase domain (β-domain) of ADC synthase resides at the N-terminus of the protein whereas the amidotransferase domain (β-domain) of AS resides at the C-terminus. Monomeric anthranilate synthases useful in the present invention identified by bioinformatics analysis include, but are not limited to, for example, Rhizobium meliloti (GI 95177), Mesorhizobium loti (GI 13472468), Brucella melitensis (GI 17982357), Nostoc sp. PCC7120 (GI 17227910, GI 17230725), Azospirillum brasilense (GI 1174156), Rhodopseudomonas palustris, Anabaena M22983 (GI 152445). FIG. 7 is an example of a sequence alignment of 2 monomeric anthranilate synthases (Agrobacterium tumefaciens and Rhizobium meliloti) with 2 heterotetrameric anthranilate synthases (Sulfolobus solfataricus and Arabidopsis thaliana) useful in the present invention. FIG. 15 is an example of a sequence alignment of several monomeric anthranilate synthases with the Rhodopseudomonas palustris heterotetrameric anthranilate synthase.

Example 10 Optimized Codon Usage

This example sets forth a method of providing an increased level of free tryptophan in the seed of a plant by improving the expression of an anthranilate synthase gene by optimization of the codon usage.

The AgroAS S51F mutant allele described in Example 4 was used to generate a codon-optimized AgroAS S51C mutant allele according to the following procedure:

-   -   1. The amino acid sequence of the AgroAS S51F allele (SEQ ID         NO: 60) was scanned using a monocot high codon usage table         (Table P) along with some of the additional criteria described         below to generate an optimized DNA sequence.     -   2. Codons were optimized by selecting codons 1-50 from the most         abundant codons from the monocot high usage table (Table Q) and         selecting codons 51-729 randomly from the table.     -   3. Undesirable sequences such as insert stops in the wrong         frames were not used, even if it meant changing the codon usage.     -   4. Codon repeats were not used.     -   5. Rare codons in the first 50 positions were not used; and         codons were randomized in other positions.     -   6. A+T-rich regions or those which may contain cryptic mRNA         processing sites were avoided. Table R lists the sites which         were avoided.     -   7. The output sequence was manually edited according to the         monocot high codon usage table (Table Q) to increase the GC         content from 56% to 59.86%, which is similar to the GC content         of the native AgroAS wild type alleles (60.3%).     -   8. The resultant sequence was again scanned according to the         method described in Flasinski, S., WO 2004/009761A2, using the         SeqLab FINDPATTERNS program to eliminate any unwanted         restriction sites or cryptic polyadenylation signals present in         the coding region (Table R). Regions which were identified by         this process were manually edited to eliminate such sites.     -   9. The final optimized DNA sequence (SEQ ID NO: 144; FIG. 22A-B)         was assembled and analyzed for integrity by translation to the         corresponding amino acid sequence (FIG. 26) and by comparing the         nucleotide and amino acid sequences with the unmodified AgroAS         S51F mutant allele (FIGS. 23A-C and 24A-B). The nucleotide and         amino acid sequences have 82.9% and 100% identity to the         nucleotide and amino acid sequences of the non-optimized AgroAS         S51F mutant allele.     -   10. The codon-optimized AgroAS S51F DNA sequence was modified at         codon 51 from TTC to TGC (nucleotides 151-153) to generate the         AgroAS S51C codon-optimized mutant allele as described in U.S.         patent application Ser. No. 10/430,011.     -   11. The DNA sequence of the codon-optimized AgroAS S51C allele         (FIG. 25A-B; SEQ ID NO: 145) was synthesized (BlueHeron         Biotechnology, Bothell, Wash.).

TABLE Q Monocot high codon usage table. Codon Usage Frequencies of Monocot_High G GGG 0.19 A GCC 0.36 T ACT 0.25 S TCA 0.11 G GGA 0.16 R AGG 0.27 T ACC 0.42 S TCT 0.18 G GGT 0.23 R AGA 0.11 W TGG 1.00 S TCC 0.28 G GGC 0.42 S AGT 0.09 * TGA 0.54 R CGG 0.14 E GAG 0.77 S AGC 0.22 C TGT 0.23 R CGA 0.05 E GAA 0.23 K AAG 0.84 C TGC 0.77 R CGT 0.14 D GAT 0.41 K AAA 0.16 * TAG 0.30 R CGC 0.30 D GAC 0.59 N AAT 0.30 * TAA 0.16 Q CAG 0.78 V GTG 0.36 N AAC 0.70 Y TAT 0.28 Q CAA 0.22 V GTA 0.05 M ATG 1.00 Y TAC 0.72 H CAT 0.32 V GTT 0.24 I ATA 0.10 L TTG 0.10 H CAC 0.68 V GTC 0.34 I ATT 0.29 L TTA 0.02 L CTG 0.30 A GCG 0.20 I ATC 0.61 F TTT 0.28 L CTA 0.05 A GCA 0.15 T ACG 0.18 F TTC 0.72 L CTT 0.20 A GCT 0.28 T ACA 0.15 S TCG 0.12 L CTC 0.33 P CCG 0.24 P CCA 0.22 P CCT 0.26 P CCC 0.28

The numbers in Table Q are the probabilities of having a particular codon used to code a particular amino acid, which are based on observed frequencies of each codon for any particular amino acid. This means that if there are four codons that encode a given amino acid, and if all of the codons are represented equally in that sequence, then each codon will get a value of 0.25. The higher the value, the higher is the frequency with which a particular amino acid is encoded by a corresponding codon.

The use of codon high usage tables in the process of codon optimization is well known in the art. Due to the rate at which genomic information is becoming available on a daily basis, construction of such tables is a dynamic process. As such, the information contained in such tables regarding the frequency of usage of each codon will change with the currently prevailing knowledge for each particular species. With this in mind, the present invention contemplates the dynamic alteration in rates of usage of each codon in the monocot high codon usage table (Table P) which was used to design the artificial polynucleotide molecules encompassed herein.

TABLE R List of unwanted restriction sites/mRNA splicing sites to be avoided when possible in the AgroAS (S51F) allele during the final scanning process using the SeqLab FINDPATTERN program. An example of a pattern data file for the program FINDPATTERNS in SeqLab. Name Offset Pattern 1 1 AATAAA 2 1 AACCAA 3 1 ATATAA 4 1 AATCAA 5 1 ATACTA 6 1 ATAAAA 7 1 ATGAAA 8 1 AAGCAT 9 1 ATACAT 10 1 AAAATA 11 1 ATTAAA 12 1 AATTAA 13 1 AATACA 14 1 CATAAA 15 1 ATTAAT 16 1 ATTTA 17 1 GGATCC 18 1 TGATCA 19 1 AGATCT 20 1 CATTT 21 1 CATTTTGTAT 22 1 ATCGAT 23 1 ATAGATT 24 1 CTGAAG 25 1 GAATTC 26 1 GATATC 27 1 GGCGCC 28 1 TGCGCA 29 1 AAGCTT 30 1 GTTAAC 31 1 GGTACC 31 1 ACGCGT 32 1 CCATGG 33 1 CATATG 34 1 GCCGGC 35 1 GCTAGC 36 1 GCGGCCGC 37 1 TCGCGA 38 1 ATGCAT 39 1 TTAATTAA 40 1 TGTAT 41 1 ACATGT 42 1 CCANNNNNTGG 43 1 AATAAT 44 1 AACCAA 45 1 CTGCAG 46 1 CGATCG 47 1 CAGCTG 48 1 GAGCTC 49 1 CCGCGG 50 1 GTCGAC 51 1 AGTACT 52 1 CCCGGG 53 1 TACGTA 54 1 ACTAGT 55 1 GCATGC 56 1 AGGTAA 57 1 AGGTA 58 1 GGTAA 59 1 GCAGGT 60 1 GCAGG 61 1 CAGGT 62 1 AATATT 63 1 AGGCCT 64 1 TGGGTA 65 1 GACNNNGTC 66 1 ATTAAT 67 1 TCTAGA 68 1 CTCGAG 69 1 TTTTT 70 1 CCCC 71 1 GGGG 72 1 AAAAA 73 1 CCCCC 74 1 GGGGG Construction of Agro AS-S51C (pMON69770 & pMON68065) and Agro AS-S51C-nno (Non-Native Optimized Gene; pMON68066

A base vector, pMON66879 (FIG. 20), was constructed comprising two transcription units between the right and left borders from Agrobacterium tumefaciens. One transcription unit for a marker comprised:

-   -   (a) DNA of FMV.35S promoter (promoter for the 35S RNA from         Figwort mosaic virus)     -   (b) DNA of L-Ph.Hsp70 (5′-untranslated region from Petunia         hybrida heat shock protein 70 gene)     -   (c) DNA of At.ShkF-CTP2 (targeting peptide of Arabidopsis EPSP         (5-enol pyruvulshikimate-3-phosphate) synthase)     -   (d) DNA of AGRtu.aroA-CP4 (coding region for non-naturally         occurring aroA-CP4)

The other transcription unit for the Agrobacterium tumefaciens anthranilate synthase S51C (AgroAS-S51C)mutant allele comprised:

-   -   (a) DNA of 7S promoter (7S alpha′ promoter from soybean)     -   (b) DNA of ATSSU1AMAT (Arabidopsis thaliana Ribulose         bisphosphate carboxylase oxygenase (RUBISCO) small subunit         chloroplast targeting peptide plus parts of the N-terminal         region of mature protein)     -   (c) DNA of AgroAS (S51C) mutant allele; and     -   (d) DNA of Agrobacterium tumefaciens nopaline synthase (nos)         terminator.

SEQ ID NO: 149 is the polynucleotide sequence of a transformation vector comprising the above-described marker and mutant allele transcription elements. See Table S below for a description of the elements of the transformation vector contained within SEQ ID NO: 149.

TABLE S Elements in a Transformation Vector Bases of SEQ ID NO: 149 Description of DNA segment  1-474 A. tumefaciens left border  516-1067 DNA of FMV.35S promoter 1071-1166 DNA of L-Ph.Hsp70 1176-1403 DNA of At.ShkF-CTP2 1404-2771 DNA of AGRtu.aroA-CP4 3524-4363 DNA of 7S promoter 4389-4652 DNA of ATSSU1AMAT 4653-6846 DNA of Agrobacterium tumefaciens anthranilate synthase S51C mutant allele 7233-7561 A. tumefaciens right border

The DNA of a vector with the elements listed in Table S, pMON66879 (FIG. 20), was digested with NcoI and StuI restriction enzymes, separated on 1 percent agarose gel, and the 1499 bp DNA fragment that represents part of the Agrobacterium anthranilate synthase S51C mutant allele (from nucleotide 1 to nucleotide 1498 of Seq ID 87) was cut out from the gel. The DNA fragment was purified using QIAGEN gel extraction kit (QIAGEN Inc.). pMON69754 was also cut with NcoI and StuI restriction enzymes and separated on 1 percent agarose gel. The 6286 bp fragment was cut out from the gel and the DNA fragment was purified using a QIAGEN gel extraction kit (QIAGEN Inc.). Ligation reactions were carried out by mixing the 6286 bp DNA fragment corresponding to pMON69754 DNA, and the 1499 bp DNA fragment corresponding to pMON66879 (FIG. 20), to generate pMON69769. The DNA of pMON69769 was digested with XhoI restriction enzyme, and the isolated 4893 bp fragment containing the Zea mays (Zm) oleosin promoter fused to the Zmhsp70 (heat shock protein 70; Zm.DNAK) intron, the AgroAS-S51C allele and the Tr7-3′UTR was cloned into the XhoI site of pMON78808 to make the final transformation vector pMON69770 (FIG. 27).

Construction of pMON68065

The DNA of pMON69769 was subjected to partial digestion by XhoI and BamHI restriction enzymes, separated on 1 percent agarose gel, and the 7238 bp DNA fragment comprising all of the genetic elements of pMON69769, except the Tr7-3′-untranslated region (3′-UTR) was isolated according to the procedures recited above.

The rice glutelin 1 (Os-gt1) 3′-UTR was amplified from pMON47901 DNA by PCR using the following primers:

5′ primer (ACGCGGATCCGTTGGCAATGCGGATAAAGAATAAC;           SEQ ID NO 150) 3′ primer (TACGCTCGAGCCATAAGATAAGGGAGGGTTGTCC;           SEQ ID NO:151) The amplified DNA fragment was cut with XhoI and BamHI restriction enzymes, and was ligated with the 7238 bp fragment of pMON69769, to generate pMON68063.

pMON68063 was then digested with XhoI restriction enzyme, separated on 1 percent agarose gel, and the 4569 bp DNA fragment that corresponds to the Zea mays Oleosin promoter fused to the Zmhsp70 (Zm.DNAK) intron, the AgroAS-S51C allele and the Os-gt1-3′UTR was isolated from the gel.

pMON78808 DNA was also cut with XhoI restriction enzyme, dephosphorylated using alkaline phosphatase enzyme (New England BioLabs Inc.), and the DNA was purified using a QIAGEN PCR purification kit (QIAGEN Inc.). The XhoI-digested DNA of pMON78808 and the 4569 bp XhoI-digested DNA fragment of pMON68063 were ligated together to generate pMON68065 (FIG. 28).

Construction of pMON68066

Codon optimization of the AgroAS-S51C mutant allele was carried out as described above. Codon-optimized AgroAS-S51C-nno DNA was synthesized (Blue Heron Biotechnology Group, Bothell, Wash., USA). The synthetic DNA was then digested with NcoI and BamHI restriction enzymes, separated on 1 percent agarose gel, and the 2193 bp DNA fragment corresponding to AgroAS-S51C-nno was cut out from the gel and purified as described above. The DNA of pMON68063 was also cut with NcoI and BamHI restriction enzymes and separated on 1 percent agarose gel. The isolated 5244 bp NcoI/BamHI-digested DNA fragment contains all the genetic elements described in pMON68063 except the AgroAS-S51C mutant allele. Ligation reactions were carried out by mixing the DNA of the NcoI/BamHI-digested AgroAS-S51C-nno DNA (2193 bp fragment) and pMON68063 (5244 bp DNA fragment), to generate pMON68064. pMON68064 was then cut with XhoI restriction enzyme, separated on 1 percent agarose gel, and the 4532 bp DNA fragment that contains the Zea mays oleosin promoter fused to the hsp70 (Zm.DNAK) intron, the AgroAS-S51C-nno and the Os-gt1-3′UTR was purified as described above. The DNA of pMON78808 DNA was also cut with XhoI restriction enzyme, dephosphorylated using alkaline phosphatase enzyme (New England BioLabs Inc.), and purified using a QIAGEN PCR purification kit (QIAGEN Inc.). Ligation reactions were carried out by mixing the dephosphorylated XhoI-cut pMON78808 DNA and the 4532 bp DNA fragment of pMON68064, to generate pMON68066 (FIG. 29).

Transformation of Maize

PMON69770, pMON68065, and pMON68066 were transformed into an elite corn inbred (LH244) (Corn States Hybrid Serv., LLC, Des Moines, Iowa) essentially as described in U.S. Patent Application Publication 20050005327, which is herein incorporated in its entirety by reference. Briefly, ears containing immature embryos are harvested approximately 10 days after pollination and kept refrigerated at 4° C. until use (up to 5 days post-harvest). The preferred embryo size for this method of transformation is ˜1.5-2.0 mm. This size is usually achieved 10 days after pollination inside the greenhouse with the growth conditions of an average temperature of 87° F., day length of 14 hours with supplemental lighting supplied by GE 1000 Watt High Pressure Sodium lamps.

Immature embryos are isolated from surface sterilized ears and directly dropped into the prepared Agrobacterium cell suspension in a 1.5-mL microcentrifuge tube. The isolation lasts continuously for approximately 5 to 60 minutes. Alternately, embryos are excised directly into inoculation medium (without Agrobacterium or acetosyringone) for 5-60 minutes and subsequently inoculated for 5-30 minutes with Agrobacterium cell suspension. After the Agrobacterium cell suspension is removed using a fine tipped sterile transfer pipette, the immature embryos are transferred onto a crn398 co-culture medium (Table T). The embryos are then placed on the medium with the scutellum side facing up. The embryos are cultured in a dark incubator (23° C.) for approximately 14-48 hours.

The embryos are then transferred onto a callus induction medium (crn336, Table T) which contains 0.1 mM glyphosate and 500 mg/L carbenicillin to inhibit Agrobacterium in Petri dishes (100 mm×25 mm). The cultures are incubated in a dark culture room/incubator at 30° C. for 2 weeks followed by an additional week in a dark culture room/incubator at 27° C. All the callus pieces are then transferred individually onto the first regeneration medium (crn335, Table T) which contains 0.1 mM glyphosate and 250 mg/L carbenicillin. The cultures are grown on this medium in a 27° C. culture room with 16 hours light/8 hours dark photoperiod for 7 to 10 days. They are then transferred onto the second regeneration medium (crn333, Table T) in Petri dishes (100 mm×25 mm) at 27° C. with 16 hours of light for approximately 2 to 3 weeks. All the callus pieces with regenerating shoots and living tissue are then transferred onto either fresh crn333 plates or crn334 phytatrays (Table T) or directly transferred to sundae cups containing a rooting medium (crn366, Table T) to grow further prior to being transferred to soil (approximately 1 to 3 weeks). The regeneration media (crn335, crn333 and crn334) all contain 250 mg/L carbenicillin and 0.1 mM glyphosate.

Plantlets are then transferred to soil, hardened off in a growth chamber at 27° C., 80% humidity, and low light intensity for approximately 1 to 2 weeks, and then transferred to a greenhouse and grown under standard greenhouse conditions. The resulting kernels are collected and analyzed as described below.

TABLE T Composition of media used in corn transformation (per Liter). Second regeneration Co-culture media Callus induction First regeneration media (crn333 (plates)/ Rooting media Component (crn398) media (crn336) media (crn335) crn334 (phytatrays)) (crn366) MS salts 2.17 g 4.33 g 4.33 g 4.33 g 2.17 g Sucrose 20 g 30 g 30 g 20 g Maltose 20 g Glucose 10 g 10 g 1-Proline 115 mg 1.38 g 1.38 g Casamino Acids 0.5 g 0.05 g IBA (1 mg/mL stock) 0.75 mL 1-Asparagine 0.15 g Myo-inositol 0.1 g NAA (1 mg/mL stock) 0.5 mL Thiamine-HCl (0.5 mg/mL stock) 1.0 mL 1.0 mL 2,4-D (1 mg/mL stock) 3.0 mL 0.5 mL Silver Nitrate (2 mg/mL stock) 1.7 mL 1.7 mL MS Vitamins 100× 10 mL 10 mL 5.0 mL MS Fromm 1000× 1.0 mL 1.0 mL Carbenecillin (250 mg/mL) 2.0 mL 1.0 mL 1.0 mL Glyphosate (0.5 M stock) 0.2 mL 0.2 mL 0.2 mL 0.2 mL Phytagel 3.0 g 3.0 g 3.0 g 3.0 g BAP (0.5 mg/mL stock) 0.02 mL 7.0 mL Acetosyringone (1.0 M) 0.2 mL Agarose Low EEO 5.5 g Analysis of Amino Acid Content of R1 Seed

Maize events transformed with the codon-optimized AgroAS constructs listed above as well as non-transgenic control events were analyzed to quantify free tryptophan levels in the kernels. Twelve mature kernels from each event were ground separately into powder and analyzed for free tryptophan (Trp) content.

To extract the amino acid fraction, 30 mg of crushed kernels are placed in a centrifuge vial. One milliliter of 5% trichloroacetic acid is added to each sample. The samples are mixed by vortex, and placed, with agitation, at room temperature for 15 minutes. The samples are then spun in a microcentrifuge for 15 minutes at 14,000 rpm. Some of the supernatant is then removed, placed in an HPLC vial and sealed. Samples are kept at 4° C. prior to analysis.

The amino acid analysis is performed as described in the Agilent Technical Publication, 2000. The separation is done using an Agilent 1100 series HPLC system (Agilent, Palo Alto, Calif.) with an Eclipse XDB-C18 5 μm, 4.6×150 mm column, and a flow rate of 1.2 ml/minute. Amino acid concentrations are measured using fluorescence: excitation at 340 nm, emission at 450 nm. Elution is accomplished with a gradient of HPLC Buffers A and B according to Table U, where HPLC Buffer A is 40 mM Na₂HPO₄, pH 7.8 and HPLC Buffer B is 9:9:2::Methanol:Acetonitrile:Water.

TABLE U Amino Acid Elution Time 0 20 21 26 27 % Buffer A 95 35 0 0 0 % Buffer B 5 65 100 100 100

Amino acid standards are prepared or purchased in concentrations ranging from 0 to 100 μg/ml. Proline analysis requires an additional derivatization step using 9-fluorenylmethyl-chloroformate (FMOC). Results, as shown in Table V, are reported in ppm.

Kernels that had free tryptophan at or below 25 ppm were grouped as negative kernels and were used to calculate the average free tryptophan in negative kernels in each event (Table V) or in the construct (Table W and FIG. 30). Kernels that had free tryptophan >25 ppm were grouped as positive kernels and the data was used to calculate the average free tryptophan in positive kernels in each event (Table V) or in the construct (Table W and FIG. 30). Mature kernels from non-transgenic events (LH244) were also analyzed for free tryptophan content. The data is presented as the average of the free tryptophan in negative kernels.

In general, transgenic events expressing the non-optimized AgroAS-S51C allele (pMON69770, and pMON68065) demonstrated an average free tryptophan level of 437 ppm. This is significantly higher than the free tryptophan levels seen in non-transgenic LH244 kernels, for which the average is 8.8 ppm. In contrast, expression of the codon-optimized AgroAS-S51C allele (pMON68066; AgroAS_S51C_nno), results in an enhanced average free tryptophan content of 768 ppm, with one event as high as 1435 ppm. Enhanced levels of free tryptophan due to expression of the codon-optimized AgroAS-S51C allele is clearly demonstrated in many transgenic events derived from pMON68066 (Tables V and W; FIG. 30).

TABLE V Average Average of Free of Free Trp in Trp in positive negative kernels Standard kernels Standard Construct Event (ppm) deviation (ppm) deviation LH244 (non- LH244 8.8 1.4 transgenic) pMON69770 ZM_S102609 481.3 5.5 9.9 1.7 ZM_S102614 441.7 135.5 9.0 1.0 ZM_S102630 593.7 42.6 12.2 4.0 ZM_S102637 343.8 27.1 9.3 1.0 ZM_S103183 412.4 40.4 8.6 1.0 ZM_S103189 433.0 61.2 9.3 0.5 ZM_S103202 417.2 48.1 10.0 1.4 ZM_S103233 353.5 102.0 11.8 1.3 ZM_S103234 515.2 61.9 10.3 2.7 ZM_S103245 537.8 38.8 10.9 1.4 ZM_S103257 386.4 24.8 8.1 0.9 ZM_S103261 413.9 46.4 10.3 1.0 ZM_S103276 371.0 61.9 8.8 0.5 ZM_S103277 417.8 40.4 7.9 1.1 pMON68065 ZM_S134462 550.8 102.3 11.7 1.7 ZM_S137274 218.2 32.5 9.2 1.6 ZM_S138234 197.1 2.1 8.6 0.8 ZM_S139813 584.8 92.1 9.8 0.5 ZM_S139815 571.5 37.2 7.4 0.7 ZM_S139816 535.3 58.8 10.0 1.4 ZM_S139817 416.5 58.3 11.0 0.7 ZM_S142728 525.3 97.3 10.3 2.5 ZM_S142738 404.7 25.5 9.2 0.7 ZM_S142742 384.6 39.9 7.5 0.8 ZM_S142747 423.9 39.7 8.3 0.6 pMON68066 ZM_S133073 824.0 71.3 8.0 1.1 ZM_S133076 672.4 52.0 8.3 1.0 ZM_S133078 524.7 66.7 8.4 0.5 ZM_S133080 670.6 40.8 9.6 1.1 ZM_S133082 588.0 38.0 8.6 1.3 ZM_S133083 731.0 81.6 7.6 0.8 ZM_S133089 800.3 87.9 10.4 1.3 ZM_S133092 603.5 89.0 10.3 2.1 ZM_S133093 1011.3 263.4 9.3 4.9 ZM_S133097 592.3 225.3 12.0 1.7 ZM_S133098 727.5 81.1 8.0 1.3 ZM_S133099 759.3 74.7 8.5 0.7 ZM_S133421 1435.2 43.7 13.1 2.5 ZM_S133431 692.6 41.8 8.3 0.8 ZM_S133433 697.0 48.1 9.2 1.9 ZM_S133434 966.3 126.9 9.8 4.2

TABLE W Average of Average of Free Trp in Free Trp in positive negative kernels (ppm; kernels (ppm; average of all Standard average of all Standard Construct the events) deviation the events) deviation LH244 (non- 8.8 1.4 transgenic) PMON69770 437.0 52.6 9.7 1.4 PMON68065 437.5 53.2 9.4 1.1 PMON68066 768.5 89.5 9.3 1.7

Example 11 Preparation of a Transformation Vector Comprising Monomeric Rhizobium meliloti Anthranilate Synthase

Cloning of Rhizobium meliloti AS gene

A stab culture of Rhizobium meliloti 1021 obtained from ATCC was used to streak a YM media (10 g mannitol, 0.5 g K₂HPO₄, 0.2 g MgSO₄.7H₂O, 1.0 g yeast extract, 0.2 g NaCl, 88 mg FeCl₃-6H₂O, 15 g agar per 1 L) plate. This plate was grown for two days at 30° C. A single colony was used to inoculate 1 liter of YM media. This culture was grown overnight at 30° C. The cell pellet was spun down at 5,000×g for 10 minute and frozen at −20° C. The Qiagen Genomic-tip DNA kit (Qiagen Inc., Valencia, Calif.) was used to extract genomic DNA according to the August 1999 Qiagen Genomic DNA Handbook (p. 42).

A PCR reaction was used to amplify the gene. The primers used were Rhizo F2: ATGGCAGCGGTAATTCTGGAAG (SEQ ID NO: 138) and Rhizo R8: TCAGGCTGCCTTGGTCTTC (SEQ ID NO: 139). The resulting PCR fragment was cloned into the pGEM (Promega Corp., Madison, Wis.) vector.

Finally, the PCR product in pGEM was amplified using PCR with the following primers: Rhizo NcoI ACTGACTCCATGGCAGCGGTAATTCTGGAA (SEQ ID NO: 140) and RhizoSpeI: CTGACTAGTTCAGGCTGCTT (SEQ ID NO: 141) and the product was cloned into TOPO 2.1 PCR vector (Invitrogen Corp., Grand Island, N.Y.).

Vector Construction

The vector containing the Rhizobium gene in the TOPO 2.1 vector was digested with SpeI and a Klenow reaction was performed to blunt the site. The DNA was PCR purified (Qiagen PCR purification kit and MinElute Handbook, 2001) and then digested with NcoI. This fragment was cloned into pET30a at the EcoRV and NcoI site creating pMON66595 (FIG. 21).

The Arabidopsis transformation vector was created in several steps by first digesting pMON13773 with NcoI/EcoRI to generate a backbone piece. pMON66595 was digested with NcoI and EcoRI and the larger portion of the Rhizobium AS gene (approximately 2000 base pairs) was removed. The two pieces were then ligated together. A positive clone was digested with EcoRI and treated with calf intestinal phosphatase (CIP). The second fragment of the Rhizobium gene was removed by digesting pMON66595 with EcoRI and keeping the approximately 200 base pair piece. The two fragments were ligated together and the resultant clones were sequenced to check for correct orientation of the small Rhizobium fragment.

One of the above clones, having correct sequence and orientation, was digested with BglII and NcoI. Then pMON58046 (FIG. 5) was digested with BglII and NcoI and the CTP2 transit peptide fragment removed. These fragments were ligated together to complete the cassette.

The ligated fragments were digested with NotI to remove the cassette. They were then ligated into pMON36524 at the CIP treated NotI site. These clones were digested to find a cassette inserted in the same orientation as the NPTII gene creating pMON66599.

The vector pMON66599 was then transformed into Agrobacterium and used to transform Arabidopsis. The control construct in this experiment was pMON66598, which is the same cassette insert as described for pMON66599, except containing the Agrobacterium AS wild type gene.

Complementation Assay

The Rhizobium AS gene from pMON66595 was excised with BamHI and NcoI and cloned into the corresponding sites of the pSE280 vector (Invitrogen), creating pMON66596. pMON66596 was then transformed into a mutant E. coli strain, EMG2 ΔtrpE (created from the EMG strain WT K12, F+, which was obtained from ATCC) showing that the homomeric gene complements the genome of the trp-strain.

Activity Assay

pMON66596 was transformed into BL-21 cells and induced with IPTG to express protein. The crude cell extract was assayed by HPLC and found to have activity and an IC₅₀ around 10 μM trp (Poulson, 1991).

Protein Expression and Purification

His-tagged protein was expressed by inducing pMON66595 with IPTG. Protein purification was completed using native conditions outlined in the QIAexpressionist 2002 Handbook and nickel resin (Ni-NTA Spin Handbook, 2000). Rabbit sera show recognition of purified protein.

Plants are transformed with the vector containing the Rhizobium anthranilate synthase gene, as in Examples 2, 3, and 6, and show elevated levels of tryptophan in the seed.

Example 12 Corn Transformation with a Vector Containing a Maize Anthranilate Synthase α-Subunit Gene and a Maize Anthranilate Synthase β-Subunit Gene

To create a shuttle vector containing the coding sequence for maize anthranilate synthase α-subunit, pMON65150, was digested with both EcoRI and SacII. The resulting 6195 base pair fragment was gel purified and then dephosphorylated. The plasmid pMON66604 was digested with both EcoRI and SacII, to generate a 1077 base pair fragment that was gel purified. The 1077 base pair fragment was then ligated in the sticky-ended 6195 base pair fragment containing the maize ASα coding sequence to generate pMON67149, a maize L3 (oleosin) promoter-maize hsp70 intron-maizeASα-Tr7 3′ UTR expression vector. This vector, was subsequently digested with XhoI, resulting in a 4364 base pair DNA fragment containing the maize oleosin promoter, maize heat shock protein 70 intron, maize anthranilate α-subunit coding sequence, and the Tr7 3′ UTR.

The plasmid pMON30167 was digested with XhoI resulting in an 8.89 Kb DNA fragment. This fragment was then ligated to the 4964 base pair fragment to create pMON79951, a transformation vector containing L3 promoter-zmhsp70 intron-maizeASα-Tr7 3′ UTR.

The coding sequence for a maize anthranilate synthase β-subunit was isolated by PCR from a Monsanto proprietary cDNA library, using the following primers:

5′ primer 5′TGCTGACCATGGCCTGCTCCCACATCGTCG3′ (SEQ ID NO: 142), which contains the NcoI restriction site (shown in bold), and

3′ primer 5′CAGTGAATTCCTACGAACGCTGCTTCTCCAGTTC3′ (SEQ ID NO: 143), which contains the EcoRI restriction site (shown in bold).

The PCR parameters used were 2°/second to 95°; 95° for 5 minutes; 95° for 30 seconds; 2°/second to 55°; 55° for 45 sec; 2°/second to 72°; 72° for 55 seconds; cycle 25 times starting at the third step; 2°/second to 72°; 72° for 10 minutes; 2°/second to 4° forever (all temperatures shown are in ° C., unless otherwise noted). The PCR mix contained: 1 μl of miniprep (Qiagen method) DNA from pMON79952, 1.5 μl of each primer (10 μM stock) 5 μl of Roche 10×PCR buffer with magnesium chloride, 2 μl 10 mM dNTP mix, 1 μl Hi-Fi Taq mix (Roche Expand High Fidelity PCR System # 1732650) and water to a total volume of 50 μl. The resulting PCR product was ligated into the pGEM-T vector (Promega pGEM-T Vector System I #A3600). Sequencing the above-described fragment revealed that it was missing restriction sites. The PCR was performed again using the pGEM clone as a template and the product cloned into the TOPO2.1 PCR Vector (Invitrogen TOPO TA Cloning Kit pCR 2.1-TOPO vector #45-0641). This clone was confirmed by sequencing. The resulting vector, containing maize ASβ, was named pMON66592. This vector was then digested with both NcoI and EcoRI, to generate an 850 bp fragment. After isolation, the ends of the 850 bp fragment were made blunt and dephosphorylated. The plasmid pMON79953 was digested with BamHI and SmaI, to generate a 5353 bp fragment. After isolation, the ends of the 5353 bp fragment were made blunt. The 850 bp fragment containing the ASβ gene was then ligated into the blunt-ended 5353 bp pMON79953 fragment to generate pMON79954, a maize L3 promoter-maize hsp70 intron-maizeASβ-Tr7 3′ UTR vector.

The vector pMON79954, was subsequently digested with XhoI to generate a 2907 base pair DNA fragment containing the maize oleosin promoter, maize hsp70 intron, maize ASβ coding sequence, and the Tr7 3′ UTR. The plasmid pMON30167 was digested with XhoI to generate an 8.89 Kb fragment. The two fragments were ligated together to generate pMON79955, a transformation vector containing L3 promoter-zmhsp70 intron-maizeASβ-Tr7 3′ UTR.

To create the maize ASα and ASβ stacking vector, pMON79955 was digested with HindIII, to generate a 16.57 Kb fragment. The fragment was made blunt and dephosphorylated. The plasmid pMON67149 was digested with XhoI, to generate a 4364 base pair DNA fragment, which was subsequently blunt ended. The two fragments were ligated together to create pMON79956), a final transformation vector containing maize L3 promoter-maize hsp70 intron-maizeASα-Tr7 3′ UTR stacked with maize L3 promoter-maize hsp70 intron-maizeASβ-Tr7 3′ UTR.

Maize Transformation Using Agrobacterium tumefaciens

Maize plants (inbred line LH198/Hi11) are grown in a greenhouse under standard practices. The ears of the plants are harvested when the embryos are 1.5 to 2.0 mm in length, usually 10-15 days after pollination. The ears are surface sterilized by spraying or soaking in 80% ethanol.

The immature embryos are isolated from individual kernels using methods known to those of skill in the art. Immature embryos are cultured on medium 211 (N6 salts, 2% sucrose, 1 mg/L 2,4-dichlorophenyoxyacetic acid (2,4-D), 0.5 mg/L niacin, 1.0 mg/L thiamine-HCl, 0.91 g/L L-asparagine, 100 mg/L: myo-inositol, 0.5 g/L MES, 100 mg/L casein hydrolysate, 1.6 g/L MgCl2, 0.69 g/L L-proline, 2 g/L GELGRO™, pH 5.8) containing 16.9 mg/L AgNO3 (designated medium 2112V) for 3-6 days prior to transformation.

Methods of Agrobacterium mediated transformation of maize cells and other monocots are known (U.S. Pat. Nos. 5,591,616 and 5,981,840; and EP 0 672 752). The Agrobacterium strain ABI, and an Agrobacterium tumefaciens binary vector system are used for the transformations.

Prior to co-culture with the maize embryo cells, Agrobacterium cells are grown at 28° C. in LB (DIFCO) liquid medium containing approximately 50 μg/ml kanamycin and 100 μg/ml spectinomycin to select for maintenance of the modified Ti plasmid and binary vector. Prior to inoculation of maize cells the Agrobacterium cells are grown overnight at room temperature in AB medium (Chilton et al., (1974) comprising appropriate antibiotics for plasmid maintenance and 200 μM acetosyringone. Immediately prior to inoculation the Agrobacterium cells are pelleted by centrifugation, washed in ½ MSVI medium (2.2 g/L GIBCO MS (Murashige and Skoog, 1962) basal salts, 2 mg/L glycine, 0.5 g/L niacin, 0.5 g/L L-pyridoxin-HCl, 0.1 mg/L thiamine, 115 g/L L-proline, 10 g/L D-glucose, and 10 g/L sucrose, pH 5.4) containing 200 μM acetosyringone.

The immature maize embryos are excised, immersed in an Agrobacterium suspension in ½ MSPL medium and incubated at room temperature with Agrobacterium for approximately 5 minutes.

Following Agrobacterium infection and co-culture, the embryos are transferred to type II delay medium for 5 to 7 days and cultured at 27° C. in the dark. The delay medium consists of MS basal salts containing 2.0 mg/L 2,4-D (GIBCO), 100 mg/L-casamino acids, 12 mM proline, 500 mg/L carbenicillin and 20 μM silver thiosulfate. All media chemicals were tissue culture grade. Once signs of type II callus initiation from immature embryos are observed, as defined by Selman et al (1994), the coleoptiles are removed from the embryos. The embryos are then transferred to MS medium containing 2.0 mg/L 2,4-D, 12 mM proline, 20 μM silver thiosulfate, 500 mg/L carbenicillin and 0.5 mM glyphosate (Monsanto Company, St. Louis, Mo.) and incubated at 27° C. in the dark for 2 weeks.

Embryos forming callus are transferred to the MS medium described above, but additionally containing 1.0 mM glyphosate. The cultures are then incubated for 2 weeks in the dark at 27° C. The embryos still having callus are then transferred to MS medium containing 3.0 mM glyphosate for an additional 2 weeks.

Plant regeneration is achieved by transferring the callus to MS medium containing 0.1 mg/L 2,4-D and 0.1 μM abscisic acid (ABA) for 2 weeks and then to MS medium containing 6% sucrose and no 2,4-D for another 2 weeks. Both incubations are done in the dark at 27° C. to permit somatic embryo maturation and conversion in the regeneration process.

Somatic embryos that are ready to germinate are transferred to hormone-free MS medium, and incubated in the light until shoots with attached roots are produced. After approximately 2 to 3 weeks, plantlets are produced.

Plantlets are then transferred to the greenhouse and grown under standard greenhouse conditions.

Analysis of Amino Acid Content of R1 Seed

Several transgenic corn lines were established for each vector and propagated through the number of generations. These lines are grown and self-pollinated to generate homozygous lines. At each generation, expression of the transgenes are determined using western blot analysis on immature seed and mature R1 seed is produced and analyzed for free amino acid content using fluorescence detection as described in Agilent Technologies Technical Bulletin REV14. Maize seeds expressing ASα protein generate elevated amounts of tryptophan relative to baseline levels (corresponding to negative isolines and nontransgenic controls). Baseline free tryptophan levels for corn range from about 5 to about 25 ppm.

Example 13 Production of Meal and Feed Products Containing Codon-Optimized AgroAS

Any of the plants or parts thereof of the present invention may be processed to produce a feed, meal, protein, or oil preparation, including feed, meal, protein and oil preparations high in total tryptophan content. A particularly preferred plant part for this purpose is a seed. In a preferred embodiment the feed, meal, protein or oil preparation is designed for livestock animals or humans, or both. Methods to produce feed, meal, protein and oil preparations are known in the art. See, for example, U.S. Pat. Nos. 4,957,748; 5,100,679; 5,219,596; 5,936,069; 6,005,076; 6,146,669; and 6,156,227. In a preferred embodiment, the feed, meal, protein or oil preparation is a high tryptophan preparation. Such a high tryptophan preparation preferably has a tryptophan content of greater than about 200 to 400 ppm, more preferably 400 to 600 ppm, and even more preferably 600 to 800 ppm.

Example 14 Detection of Unique Polynucleotide Sequences in Meal, Feed, Protein and Oil Products

This example describes methods that can be used to detect the presence of unique fragments of an artificial polynucleotide sequence of the present invention in meal, feed, protein and oil products. Methods for comparing a known sequence to other known sequences in order to determine which parts of the known sequence are unique (e.g. less than 100% identical) to other known sequences are well known to those of skill in the art. It has been determined by using a BLAST search of all known corn DNA sequences that the polynucleotides of the present invention are unique to at least the 21 mer level. That is to say that the polynucleotides of the present invention contain no segments of consecutive nucleotides greater than 20 nucleotide bases that are a 100% identical match to any other known corn DNA sequence. Additionally, a comparison of the polynucleotides of the present invention with all known anthranilate synthase sequences using a BLAST search revealed that there are no other anthranilate synthase sequences that match 100% to the claimed polynucleotide sequences for more than 32 consecutive nucleotides. That is to say that the polynucleotides of the present invention, when compared to all known corn DNA sequences and all known anthranilate synthase sequences, are unique to at least the 33 mer level. Sequences that are unique to the claimed invention may be measured by methods well-known to those of skill in the art including, but not limited to hybridization by the Southern blot method.

Isolation of Genomic DNA

Genomic DNA from test and control samples can be extracted using standard extraction protocols for nucleic acids. Briefly, 200 to 500 mg of corn meal or seed powder is transferred to 10 ml conical tubes; approximately 10 ml of CTAB extraction buffer (1.5% (w/v) CTAB, 75 mM Tris-HCl pH 8.0, 100 mM EDTA pH 8.0, 1.05 M NaCl, and 0.75% (w/v) PVP (MW 40,000)) is added to the tube. The samples are incubated at 65° to 70° C. for 60 minutes with intermittent mixing. Approximately 12 ml of chloroform is added to the samples. The suspensions are inverted for 5 minutes by hand, and then the samples are aliquotted into 10 ml polypropylene tubes (3/sample). The samples are centrifuged at ˜10,000×g for 5 to 10 minutes at room temperature. The aqueous (upper) layer is transferred to a clean 10 ml polypropylene tube and the chloroform extraction as above is repeated with 1 volume of chloroform. The aqueous layers from same event samples are combined in a single 50 ml conical tube and an equal volume of 95% EtOH is added. The precipitated DNA is pooled into a 10 ml polypropylene tube containing 5 ml 70% ethanol, and centrifuged at ˜10,000 rpm for 2-3 minutes to pellet the DNA. The pellet is transferred to a microfuge tube containing 1 ml 70% EtOH and spun at 14,000 rpm in a microcentrifuge for 1 minute. The DNA is dissolved in H₂O, and the DNA is quantified using Sigma's DNA Quantitation kit (catalog #DNA-QF) (or equivalent) and a Hoefer DyNA Quant 200 Fluorometer (or equivalent).

Restriction Enzyme Digestion of Genomic DNA

Approximately 10 μg of genomic DNA from a test substance and ˜10 μg of genomic DNA from a control substance can be used in independent restriction enzyme digestions. Overnight digests are carried out at 37° C. (60° C. for BstEII) in a total volume of 300-500 μl using 100 units of the appropriate restriction enzyme(s) and 50 ng DNase-free RNase (Roche or equivalent). After digestion, DNA is precipitated from the samples by adding 2-2.5 volumes of 100% ethanol followed by incubation in a −20° C. freezer for at least 1 hour and centrifugation for 20 minutes in a microcentrifuge. The DNA is redissolved in water.

DNA Probe Preparation for Southern Blot Analysis

Probe template DNA containing sequences of plasmid pMON68066 is prepared by standard PCR amplification. Approximately 25 ng of each probe template can be used to label with ³²P-dCTP (˜6000 Ci/mmol) by a random priming method (RadPrime DNA Labeling System, Life Technologies, Gaithersburg, Md.). The labeled probes are purified with Roche G-50 columns per the manufacturer's protocol. One microliter of each labeled probe is counted, and 1.0-1.5×10⁶ cpm/ml hybridization buffer can be used per blot.

Southern Blot Analysis of Genomic DNA

Separation of the samples of DNA digested with restriction enzymes is based on size using 0.8% (w/v) agarose gel electrophoresis in 1×TBE buffer. The gels are depurinated in a 0.25 M HCL solution for 20 minutes, denatured for 30 minutes in 0.4 N NaOH, 0.6 M NaCl and neutralized in 1.5 M NaCl, 0.5 M Tris Base pH 7.0 for 30 minutes. Treated gels are transferred to pretreated GeneScreen Plus® membranes for 4-20 hrs in 10×SSC using a Turbo Blotter. The membranes are crosslinked in a UV Stratalinker® 2400 (Stratagene, La Jolla, Calif. at 120 mjoules. The membranes are prehybridized in 10% Dextran Sulfate solution (10% Dextran Sulfate, 1% SDS, 1 M NaCl₂, 100 μg/ml denatured, single stranded fish sperm DNA) for 1 hr at 65° C. The prehybridization solution is replaced with a hybridization solution containing 1.0-1.5×10⁶ cpm/ml denatured probe in a 10% Dextran Sulfate solution. Hybridization is performed at 65° C. for 15 to 20 hrs. The blots are washed at 68° C. in varying concentrations of SSC, SDS solutions, and exposed to Kodak BIOMAX MS film (Kodak, Rochester, N.Y.) in conjunction with one Kodak BIOMAX MS intensifying screen in a −80° C. freezer.

All publications and patents are incorporated by reference herein, as though individually incorporated by reference. The invention is not limited to the exact details shown and described, for it should be understood that many variations and modifications may be made while remaining within the spirit and scope of the invention defined by the claims.

REFERENCES U.S. Patents

-   U.S. Pat. No. 5,500,365; U.S. Pat. No. 4,581,847; U.S. Pat. No.     4,940,838; U.S. Pat. No. 4,957,748; U.S. Pat. No. 5,100,679; U.S.     Pat. No. 5,219,596; U.S. Pat. No. 5,258,300; U.S. Pat. No.     5,290,924; U.S. Pat. No. 5,302,523; U.S. Pat. No. 5,384,253; U.S.     Pat. No. 5,489,520; U.S. Pat. No. 5,550,318; U.S. Pat. No.     5,591,616; U.S. Pat. No. 5,936,069; U.S. Pat. No. 5,981,840; U.S.     Pat. No. 5,990,390; U.S. Pat. No. 5,990,390; U.S. Pat. No.     6,005,076; U.S. Pat. No. 6,069,289; U.S. Pat. No. 6,118,047; U.S.     Pat. No. 6,146,669; U.S. Pat. No. 6,156,227; U.S. Pat. No.     6,160,208; U.S. Pat. No. 6,268,550; U.S. Pat. No. 6,281,411; U.S.     Pat. No. 6,303,364; U.S. Pat. No. 6,320,100; U.S. Pat. No.     6,329,574; U.S. Pat. No. 6,331,655; U.S. Pat. No. 6,331,880; U.S.     Pat. No. 6,433,252 -   U.S. patent application Ser. No. 10/430,011 -   U.S. Patent Publn. 20050005327 -   Adang et al., Gene, 36:289, 1985. -   Agilent Technical Publication, “Amino Acid Analysis Using Zorbax     Eclipse-AAA Columns and the Agilent 1100 HPLC”, Mar. 17, 2000 -   Altschul et al., Nucleic Acid Res., 25:3389-3402, 1997. -   An, Methods in Enzymology, 153:292, 1987. -   Bae et al., J. Bacteriol., 171:3471-3478, 1989. -   Barnes et al, Adv. Exp. Med. Biol., 401:87, 1996. -   Beachy et al., EMBO J, 4:3047, 1985. -   Berg et al., Biotechnology, 1:417, 1983. -   Bevan et al., Nucl. Acid Res., 11:369, 1983. -   Bock., J. Mol. Biol., 312:425-438, 2001. -   Bohlmann, et al., Plant Phys., 111:507-514, 1996. -   Bouchez et al., EMBO J, 8:4197, 1989. -   Bowler et al., Ann. Rev. Plant Physiol., 43:83, 1992. -   Broadbent et al., Curr. Med. Chem., 5:469, 1998. -   Caligiuri and Bauerle, J. Biol. Chem., 266:8328-8335, 1991. -   Callis et al., Genes Develop., 1:1183, 1987. -   Campbell, W. C., Ed., Ivermectin and Abamectin. Springer Verlag, New     York, N.Y., 1989. -   Chandler et al., The Plant Cell, 1:1175, 1989. -   Chilton et al., Proc. Nat. Acad. Sci. USA, 71:3672-3676, 1974. -   Conkling et al., Plant Physiol., 93:1203, 1990. -   Cutler et al., J. Plant Physiol., 135:351, 1989. -   Czapla and Lang, J. Econ. Entomol., 83:2480, 1990. -   Daniell et al., Trends in Plant Science, 7:84-91, 2002. -   Dayhoff, In: Atlas of Protein Sequence and Structure”, Natl. Biomed.     Res. Found., 5:101-110; 5(2):1-10, 1972. -   De Troch et al., Curr. Microbiol., 34:27-32, 1997. -   Dean et al., Nucleic Acids Res., 14:2229-2240, 1986. -   Dellaporta et al., In: Chromosome Structure and Function: Impact of     New Concepts, 18th Stadler Genetics Symposium, 11:263-282, 1988. -   D'Halluin et al., The Plant Cell, 4:1495, 1992. -   Dure et al., Plant Molecular Biology, 12:475, 1989. -   Ebert et al., Proc. Nat. Acad. Sci. USA, 84:5745, 1987. -   Ellis et al., EMBO J., 6:3203, 1987. -   EP 0 672 752 -   EP 154 204 -   Frey et al., Proc. Natl. Acad. Sci. USA, 97:14801, 2000. -   Fromm et al., Bio/Technology, 8:833, 1990. -   Fromm et al., The Plant Cell, 1:977, 1989. -   Gatehouse et al., J. Sci. Food Agric., 35:373, 1984. -   Geiser et al., Gene, 48:109, 1986. -   Goodall and Filipowich, Cell, 58:473-483, 1989. -   Goodner et al. Science, 294:2323-2328, 2001. -   Gordon-Kamm et al., Plant Cell, 2:603, 1990. -   Gupta et al., Proc. Natl. Acad. Sci. USA, 90:1629, 1993. -   Hajdukiewicz et al., EMBO J, 16:4041-4048, 1997. -   Hammock et al., Nature, 344:458, 1990. -   Hankins et al., Plant Physiol., 57:101, 1976. -   Heifetz, Biochimie, 82:655-666, 2000. -   Heijne et al., Eur. J. Biochem., 180:535, 1989. -   Hilder et al., Nature, 330:160, 1987. -   Hinchee et al., Biotech., 6:915, 1988. -   Hofte et al., Microbiol. Rev., 53:242, 1989. -   Hudspeth et al., Plant Mol. Biol., 12:579, 1989. -   Ikeda et al., J. Bacteriol., 169:5615, 1987. -   Ikuta et al., Biotech., 8:241, 1990. -   Jones et al., Analyst, 106:968, 1981. -   Joshi, Nucl. Acid Res., 15:6643, 1987. -   Kaasen et al., J. Bacteriology, 174:889, 1992. -   Katz et al., J. Gen. Microbiol., 129:2703, 1983. -   Keegstra et al., Ann. Rev. Plant Physiol. Plant Mol. Biol., 40:471,     1989. -   Knochel et al., Proc. Natl. Acad. Sci. USA, 96:9479-9484, 1999. -   Laursen et al., Plant Mol. Biol., 24:51, 1994. -   Lawton et al., Plant Mol. Biol., 9:31F, 1987. -   Lee and Saier, J. Bacteriol., 258:10761, 1982. -   Levings, Science, 250:942-947, 1990. -   Li and Last, Plant Physiol., 110:51-59, 1996. -   Link et al. J. Bacteriol., 179:6228, 1997. -   Loomis et al, J. Expt. Zoology, 252:9, 1989. -   Maliga, Current Opinion in Plant Biology, 5:164-172, 2002. -   Mantegani et al., Farmaco, 54:288, 1999. -   Mash et al., Ann. NY Acad. Sci., 844:274, 1998. -   McCown & Lloyd, Proc. International Plant Propagation Soc., 30:421,     1981. -   Melanson et al., Proc. Natl. Acad. Sci. USA, 94:13345, 1997. -   Mitra et al., Molec. Gen. Genetic., 215:294, 1989. -   Muller et al., Biol. Chem., 381:679, 2000. -   Murakami et al., Mol. Gen. Genet., 205:42, 1986. -   Murashige and Skoog, Physiol Plant, 15:473-497, 1962. -   Murdock et al., Phytochemistry, 29 85, 1990. -   Murray et al., Nucl. Acids Res., 17:477, 1989. -   Niedz et al., Plant Cell Reports, 14:403, 1995. -   Niyogi and Fink, Plant Cell, 4:721, 1992. -   Niyogi et al., Plant Cell, 5:1011, 1993. -   Odell et al., Nature, 313:810, 1985. -   Ow et al., Science, 234:856, 1986. -   Paulsen, J. Chromatogr., 547:155-160, 1991. -   PCT Appln. PCT/CA99/00219 -   PCT Appln. WO 2004/009761A2 -   Perlak et al., Proc. Natl. Acad. Sci. USA, 88:3324-3328, 1991. -   Potrykus et al., Mol. Gen. Genet., 199:183, 1985. -   Poulson, J. Chromatogr., 547:155-160, 1991. -   Prasher et al., Biochem. Biophys. Res. Comm., 126:1259 (1985) -   Ranch et al., Plant Physiol., 71:136, 1983. -   Sambrook and Russell, Molecular Cloning: A Laboratory Manual, 3^(rd)     Edition, 2001. -   Sambrook et al., Molecular Cloning: A Laboratory Manual, 2^(nd)     Edition, 1989. -   Selman et al. In: The Maize Handbook, Freeling and Walbot (Eds.),     Springer Verlag, 672, 1994. -   Siehl, D., pp 171-204 In: Plant Amino Acids: Biochemistry and     Biotechnology, BK Singh (Ed.), Marcel Dekker, 1999. -   Slater et al., J. Bact., 180:1979-1987, 1998. -   Smith and Waterman, Adv. in Appl. Math., 2:482-489, 1981. -   Smith et al., Nucl. Acids Res. 11:2205-2220, 1983. -   Stalker et al., Science, 242:419, 1988. -   Stark et al., Science, 258:287, 1992. -   Sullivan et al., Mol. Gen. Genet., 215:431, 1989. -   Sutcliffe, Proc. Nat. Acad. Sci. USA, 75:3737, 1978. -   Thillet et al., J. Biol. Chem., 263:12500, 1988. -   Twell et al., Plant Physiol., 91:1270, 1989. -   Umbarger, Ann. Rev. Biochem., 47:555, 1978. -   Vasil et al., Plant Physiol., 91:5175, 1989. -   Walker et al., Proc. Nat. Acad. Sci. USA, 84:6624, 1987. -   Walters et al., Plant Molecular Biology, 18:189, 1992. -   Wang et al., Mol. Cell. Biol., 12:3399, 1992. -   Whitely et al., Ann. Rev. Microbiol., 40:549, 1986. -   Widholm, Biochim. Biophys. Acta, 320:217, 1973. -   Widholm, Biochimica et Biophysica Acta, 279:48, 1972. -   Wolter et al., The EMBO J., 11:4685, 1992. -   Yang et al., Proc. Nat. Acad. Sci. USA, 87:4144, 1990. -   Zeligs, J. Med. Food, 1:67, 1998. -   Zukowsky et al., Proc. Nat. Acad. Sci. USA, 80:1101, 1983. 

1. A polynucleotide molecule comprising a nucleotide sequence at least 84% identical to SEQ ID NO:145, wherein said polynucleotide molecule encodes a tryptophan feedback insensitive anthranilate synthase polypeptide comprising one or more amino acid substitutions at positions corresponding to position 48, 51, 52, 293, or 298 in wild-type Agrobacterium tumefaciens anthranilate synthase depicted in SEQ ID NO:4.
 2. The polynucleotide molecule of claim 1, wherein the nucleotide sequence comprises SEQ ID NO:145.
 3. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises the polypeptide sequence of SEQ ID NO:61.
 4. A DNA construct comprising the polynucleotide molecule of claim 1 operably linked to a promoter that functions in plants.
 5. A transgenic cell transformed with the construct of claim
 4. 6. The cell of claim 5, further defined as a plant cell.
 7. The transgenic cell of claim 6, wherein the plant cell is from a plant selected from the group consisting of corn, rice, wheat, barley, millet, oat, and sorghum.
 8. The transgenic cell of claim 7, wherein the cell is a corn cell.
 9. The transgenic cell of claim 5, wherein the polynucleotide molecule is expressed in the cell and confers an increased level of tryptophan to the cell relative to an isogenic cell lacking said molecule.
 10. A transgenic plant comprising the construct of claim
 4. 11. A transgenic plant produced from the cell of claim
 5. 12. The transgenic plant of claim 10, wherein the polynucleotide molecule is expressed in seeds of said plant and confers a nutritionally enhanced phenotype compared to isogenic plant seeds lacking said molecule.
 13. The transgenic plant of claim 12, wherein the nutritionally enhanced phenotype is selected from the group consisting of increased tryptophan, increased isoleucine, increased valine, increased arginine, increased lysine, increased methionine, and increased threonine.
 14. The transgenic plant of claim 12, wherein the transgenic plant is a plant selected from the group consisting of corn, rice, wheat, barley, millet, oat, and sorghum.
 15. The transgenic plant of claim 14, wherein the plant is a corn plant.
 16. Seed or silage comprising the plant cell of claim
 6. 17. A method of producing a nutritionally enhanced feed product, comprising processing the transgenic plant of claim 12 or a part thereof into meal, animal feed, protein or oil.
 18. A feed product produced by the method of claim 17, wherein the feed product comprises a detectable nucleic acid sequence comprising the polynucleotide molecule of claim 1 or a complement thereof.
 19. The feed product of claim 18, wherein the feed product is an animal feed or human food.
 20. A method of providing an increased level of tryptophan in the seed of a plant comprising: a) obtaining a DNA construct according to claim 4; b) introducing the DNA construct into a plant cell; c) regenerating said plant cell into a fertile transgenic plant; and d) obtaining at least one seed from the fertile transgenic plant or progeny thereof that comprises the DNA construct and exhibits increased tryptophan compared to a plant seed lacking said construct, wherein said increased tryptophan is a result of the expression of the DNA construct in the seed.
 21. A method of providing seed with an increased level of tryptophan comprising: a) introducing into a plant the construct of claim 4; and b) allowing the plant to produce seed expressing the construct; wherein said seed comprise an increased level of tryptophan compared to seed lacking said construct.
 22. A progeny plant produced from the seed of claim 21, wherein the progeny plant comprises said construct.
 23. A seed produced from the progeny plant of claim 22, wherein said seed comprises said construct.
 24. The method of claim 21, wherein the plant is selected from the group consisting of corn, rice, wheat, barley, millet, oat, and sorghum.
 25. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises an amino acid substitution at position
 51. 26. The polynucleotide molecule of claim 25, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises a Phe or Cys substitution at position
 51. 27. The polynucleotide sequence of claim 26, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises a Cys substitution at position
 51. 28. The polynucleotide molecule of claim 1, wherein the one or more amino acid substitution(s) are selected from the group consisting of: Phe or Tyr at position 48; Phe or Cys at position 51; Phe at position 52; Ala or Gly at position 293; and Trp at position
 298. 29. The transgenic plant of claim 10, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises the polypeptide sequence of SEQ ID NO:58; SEQ ID NO:59; SEQ ID NO:60; SEQ ID NO:61; SEQ ID NO:62; SEQ ID NO:63; SEQ TD NO:64; or SEQ ID NO:65.
 30. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises an amino acid substitution at a position corresponding to position 48 in wild-type Agrobacterium tumefaciens anthranilate synthase depicted in SEQ ID NO:4.
 31. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises an amino acid substitution at a position corresponding to position 51 in wild-type Agrobacterium tumefaciens anthranilate synthase depicted in SEQ ID NO:4.
 32. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises an amino acid substitution at a position corresponding to position 52 in wild-type Agrobacterium tumefaciens anthranilate synthase depicted in SEQ ID NO:4.
 33. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises an amino acid substitution at a position corresponding to position 293 in wild-type Agrobacterium tumefaciens anthranilate synthase depicted in SEQ ID NO:4.
 34. The polynucleotide molecule of claim 1, wherein the tryptophan feedback insensitive anthranilate synthase polypeptide comprises an amino acid substitution at a position corresponding to position 298 in wild-type Agrobacterium tumefaciens anthranilate synthase depicted in SEQ ID NO:4. 